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BPH6589 41489 bp DNA 

Bacteriophage phi-C31 complete genome. 
AJ006589 

AJ006589. 1 GI: 3947449 

C gene; dCMP deaminase; deoxynucleotide monophosphate kinase; DNA 
polymerase; int gene; integrase; large subunit; major capsid 
protein; major tail protein; portal protein; protease; putative; 
repressor; tail fibre protein; tail tape measure protein; 
terminase; transfer RNA-Thr; tRNA-Thr. 
Bacteriophage phi-eSi. 
Bacteriophage phi-C31 
Viruses; dsDNA viruses. 
Lambda phage group . 

1 (bases 1 to 41489) 

Hendrix,R.w. , Smith, M.C, Burns, R.N. , Ford,M.E. and Hatfull,G.F- 
Evolutionary relationships among diverse bacteriophages and 
prophages: all the world's a phage 

Proc. Natl. Acad. Sci. U.S.A. 96 (5), 2192-2197 (1999) 
99162580 

2 (bases 1 to 41489) 

Smith, M.C., Burns, R.N. , Wilson, S.E. and Gregory, M. A. 

The complete genome sequence of the Streptomyces tenperate phage 

straight phiC31: evolutionary relationships to other viruses 

Nucleic Acids Res. 27 (10), 2145-2155 (1999) 

99238410 

3 (bases 1 to 41489) 
Smith, M. CM. 

Direct Submission 

Submitted (Ol-JUN-1998) Smith M.C.M., Genetics, University of 
Nottingham, Queens Medical Centre, Nottingham, NG7 2UH, UK 

Loca tion/Quali f iers 

1. .41489 

/organism=" Bacteriophage phi-C31" 
/virion 

/strain=" Norwich stock" 
/db_xref =" taxon : 10719" 
145. .585 
/gene="31" 
145. .585 
/gene="31" 
/codon_start=l 
/transl_table=ll 
/product=" gp31" 
/protein_id=" CAA07101 . 1" 
/db_xr e f =" GI : 3 9 4 7 4 5 0" 
/db_x r e f =" S PTREMBL : Q 3 7 8 3 6" 

/ trans la tion=" MSRAKS PELRTGNANAAAESAAPWYEGRAPRVPAHLKATGKDV 
WRNVWQAGMGAYSPDTDRNVILRYCELHDRRADLLSLIEADGYMSEGYNGQPVAHPML 
RYVESTEKELRSIETAIGFTPEARMRLGIVAT^ARKVAAGPEDF*' 
complement (640. , 1170) 
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m 

/db_xr^^" SPTREMBL: Q9ZXB1" 

/ 1 r a ns^^L on=" MTKTTRELR FAVGTLEERASDDGR I^^YAYR FNE LSHDLGGF 
J^ERIVPWMpSLRQNDVYATFNHDTRSLLGRTSSGTLRl^iREGGWYEIDLPDTTV 

GRDVAKLEKRGDLQGSSFTFRVLDGGQRRADDDDPETGLPVREITAMDWELGPWNP 

AYPTTQASLRSIEETLRIGEFAPPTEERDSQPDGDVAPASHFDARSLVRALSK" 
gene 4893.. 6071 

/gene="36" 

CDS 4893.. 6071 

/gene=" 36" 

/function="ma jor capsid protein" 
/ codon_s t a r t = 1 
/transl_table=ll 
/product=" gp36" 
/protein_id=^'CAA07106. 1" 
/db_xref=" GI : 3947455" 
/db_xre f =" S PTREMBL : Q9ZWV6" 

/translation="MDATTLSANFEARERAT;^LRSLTDEFAGKEMTAEAREKEERLL 
TAVADFDGRIKRGIDAIKATDAVTSLLSGLQGSGSGAQRSADHDDDAVLRAGNLGEAR 
SFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGASTFTTSDANPMDFTV 
ITGRATAGIVGETAEIPESYPATTQRSMGGFKYGFASWSYEFATDQVLDLVGFLVSD 
AGPAIGDAMGRHFLTGTGTGQPRGILTDATGANAAFGEADADSKVSDALIDLFHEVPS 
AYRKNAKFVVNDLRAAQMRKLKDANGQYLWQSALTVGAPDTFNGKNA/^TDDGMPADKV 
LFADLSKYRVR FAGS LRVDRSVDAKFSTDQI VYR FLQRAPGLLVDARGAKVT.TVT PAA 

matjpeptide 522 6, .6068 

/gene="36" 

/product^" gp36" 
gene 6144.. 6512 

/gene="37" 
CDS 6144.. 6512 

/gene=" 37" 

/codon_start=l 

/transl_table=ll 

/product=" gp37" 

/protein_id=" CAA07107 . 1" 

/db_xref=" GI : 3947456" 

/db_xr e f =" SPTREMBL : Q92XB0" 

/translation=«'MAYATIEELRALDGLDDSALFSDELLSDAIDFSVETVEAYCGRK 

WDTAEDPTPETIRWCVRTLARQYVLDHVSRI PDR7VLQLQSE FGS IQLAQAGGNWRPTS 

LPEVNAKLNLYRVRLPFI FM" 
gene 6519.. 6956 

/gene=" 38" 
CDS 6519.. 6956 

/gene=" 38" 

/codon_start=l 

/transl_table=ll 

/products" gp38" 

/protein__id=" CAA07108 . 1" 

/db_xref="GI : 3947457" 

/db_xref =" SPTREMBL : Q9ZXA9" 

/ 1 r ans 1 a t ion=" MALI FNAKVRL FEALKANVPGDVQCTFAETGDNSRRKQVWLGAT 

VDDDIAPVAMRSGAKPTNVTGYVEAHAVVTTPGNPVDAERAVYGIRDYVKAACAAVNA 

DLVSVPGLMDVRPESASVESTETTDGAYSALTVRVRVRGRVYa' 
gene 6989.. 7975 

/gene=" 39" 

CDS 6989.. 7975 

/gene=" 39" 

/function=" major tail protein" 
/codon_start=l 
/transl_table=ll 
/product=" gp39" 
/protein_id="CAA07109. 1" 
/db_xref="GI : 3947458" 
/cib_xref=" SPTREMBL:Q9ZXA8" 
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ranslation="MALDASIGIGREDTYGTLSGVVEC 

•MQTARADRRNMVNMGGEGELEIDVLDAGAO^^'; 
feEAPSFTAQMVRPGVDGSKVAFKHLGCVA'^^L'I 



iiGYEGQADSWKQTVEPIESVGF 
pAAFDKVTVTDTGGVKTTVLET 

LiTAEVEEAVKLAVTFDFRDATH 

TSTPAEIVEPTYPAEAYPYDWTRTAVELSRAGSWEFDATSLELTGELGMKTDRRFLl^' 
GSELKKKPVRNALPTYEGTLEGEFSAASLGLYDAFVSGEVCSFKATFFGILPGSSLSy 
EAPAIQFTGES PEAATDEVTIHNLPFRVLDPGDGVTPAI KLT YVE PGT PVE P" 
gene 7975.. 8406 

/gene="40" 
CDS 7975.. 8406 

/gene="40" 
/codon_start=l 
/ transl_table=l 1 
/product=" gp40" 
/protein_id="CAA07110. 1" 
/db_xref="GI : 3947459" 
/db_xr e f =" S PTREMBL : Q9ZXA7" 

/translation=" MAQRSAYTIQVDGLRQFQRNVRALRDKELNKAVREANKASGEVL 

IPQAKHESPDGHRDPKSSKRYRPGKLDKSIKVTASAKGAVIKAGSAARVPYAAAIHFG 

YRKRNI SANRFLYRAMARKSDWAATYERRIAAWEKYLES" 
gene 8406.. 8687 

/gene="4r' 
gene 8406.. 8878 

/gene=" 42" 
CDS 8406.. 8687 

/gene="41" 

/codon_start=l 

/transl_table=ll 

/product^" gp41" 

/protein_id="CAA07111. 1" 

/db_xref="GI : 3947460" 

/db_xref=" S PTREMBL : Q9ZXA6" 

/translation="MPQRKPAFEIPDDFTLDLKLDSLTIDEIDAIEEITGQPLDSLNK 
AGSRRAPMLRAMAYWMKRKFPE IE PADVGKLKLNLKGK7UCPDPTATNA" 
join (84 06. . 8648, 8648. .8878) 

/gene="42" 

/note=" ribosomal slippage" 
/codon_start=l 
/transl_table=ll 
/p r oduc t =" gp 4 2 " 
/protein_id=" CAA07112 . 1" 
/db_xref="GI : 3947 461" 

/translation="MPQRKPAFEIPDDFTLDLKLDSLTIDEIDAIEEITGQPLDSLNK 

AGSRRAPMLRAMAYWMKRKFPEIEPADVGKLKLNLKGEGEAGPYRDQRVIACARLVS 

HFRGLTWSDVRGMELRDFNALVEQMAADIEAERRDTKRAGRGRSGGTERRTPVMT' 
gene 8888.. 11077 xn-^rvx rrvm 

/gene=" 43" 
CDS 8888.. 11077 

/gene="43" 

/function^" putative tail tape measure protein" 

/codon_start=l 

/transl_table=ll 

/product=" gp43" 

/protein_id=" CAA07113. 1" 

/db_xref="GI : 3947462" 

/db_xref="S PTREMBL :Q9ZXA5" 

/translation="MARPIQITIMGDADQLSETLDQASEEVSAFGEQAKGLALAAGGA 

LKGLWQQGLVPAGATADDNGEHFEKAMDVATVLGDEVGPTSNAVGQMLKTGMAKNADE 
AFDILVRGAQEGANKSEDLLDTFNEYGVQFKGIGLDGKTAMGLLSQGLQGGARDADLV 
ADSLKEFGLIVRAGGDEVNAAYKSMGLNGAEMTKAIAQGGPVAKDALDKTLDGLRKIK 
DPAERIATAVTLFGTQAEDMQDALLKLDPSSAVETLGKVDGAAKSAGETMHDNAATKI 
KAFTRGLQTGLVDFIGGTVLPILEKFKPALEGIGSTMATVGGFVSEHSTTFKWAGir 
TAVLLPALIQWGVQSTINAGKAWAWVTSSATAVIESTKQALAHAKWAGWIASGVOA 
GLNAAKWAGWVLMGAQSMIQGARMAAAWLLAMGPI PLI lAAIVGLWLI VANWDKIW 
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AYTKKV^^LWDWKKIFhWJ^DLFLNFTGPGLLIKHWDlSlfsATKhrrFh^ 

AIJ«VVJ^gCGLPGRlLSA7^SLLSAGKRIGGYVIDGIKg|^KLGGFASSIASAVGR 

^^G/va^BCDLLNWATPNKLGWGKLSIDLPDNPIPKIll^PlpASGWTRVGERGPED 

VFLPNGWvRPNHALSGSGGVTVNVQTNTU^PFAI GREVAWALRTS PA" 
gene 11124,. 11990 

/gene="4 4" 
CDS " 11124.. 11990 

/gene="4 4" 

/codon_start=l 

/transl_table=ll 

/produc t=" gp4 4" 

/protein_id="CAA07114. 1" 

/db_xref=" GI : 3947463" 

/db_xre f =" SPTREMBL : Q9ZXA4" 

/translation="MAELSDWTCEYRGLVMGLPDSAISIVGVDGLLTMPDVRSSDLTL 

VQRNGLWAGRDYLNGRTVTLTLEVYGRDRAE FTEALNALQAAFMPGVDES PFRFRFPG 

AASDRTAFVMARARKRSAPLDLNFAYLTCNMSVELYATSPYIVGDTUU^TVTVRSYKRD 

KVPTGLVLPAWPWQIEGQGPAPDDPVSRFTQYGSVAARPSIVITDAASPWLVDDVTG 

AFFAIDYDGTWIDSAAETVTNAEGSDIRGLIADGSTWPEYGPGDHRLRLRSRDEYTA 
ASAS LTWSDRWV" 

gene 11990.. 12976 

/gene="45" 
CDS 11990.. 12976 

/gene=" 45*' 

/ codon_start=l 

/transl_table=ll 

/produc t=" gp45" 

/protein_id="CAA07115. 1" 

/db xref="GI : 3947464" 

/db_xr e f =" S PTREMBL ; Q9ZXA3" 

/ 1 r a ns 1 a t ion=" MSS FAWFQDGVGYGASQLADWQVIATARGGFRHVFKTTSE FLSN 
SNQTARTVAVGSGTVLI GGTAGGGTWAWSSGETVAI PAASNTNPRKDLI VARLTTSAA 

DGFNGLAIEWQGTPAASPTVPTRPDNAAAIAIVDVPKASTTFTLTVCRTSGQYTDQA 

AYGNGSLCIDWAAVLPSPSAFPVGFTLYDSGTNQTWVRLDSGDWFTKDPGPWKKCTPQ 

NVQAKDGThATTVTGDLYVRESSLGWELSGQLNFSPSKDLEVLVYVATLPTGITRPTQN 

TYGASGQTYGSTSAGGVGRIALMSSGSIEYGCDGVIANLYVNEQFSKSPWNS" 
gene 13017.. 14033 

/gene=" 46" 

CDS 13017.. 14033 

/gene="46" 

/codon_start=l 

/transl_table=l 1 

/produc t=" gp4 6" 

/protein_id="CAA07116. 1" 

/ db_x re f =" GI:3947465" 

/db_xr e f =" S PTREMBL : Q92XA2" 

/translation="MrEYEVLQIEAKTGDVIATLPVTGrKYGETLNAAGTATVGMPLD 
AADPDTLQPGRSGLWLRDGEPDWGGLLWTTTADLAAGTLTLNASGWHSYYAGRVLHD 
GYERKTDQALLLADWYALCNEDGGIGTDTSRLTTSGRWSRLWTQYELKWAEAISEL 
AEDDGGFYFRYETYWRSVTQVGNRVLKYTPGSASTPFALTHGVNCDVTQVSYDSAAMA 
TRAYAVGADNGNGTKLVGIADNALDMPTKHWQSFSDVKSTESLISKAYAIATAGAAP 
J^^J^'TLYPGAFKPSDFVPGASGWQVDSGYVRVFDDFVITERSTSIDENGTELTNV 

gene 14048.. 14872 

/gene=" 47" 
CDS 14048.. 14872 

/gene=" 47" 

/codon_start=l 

/transl_table=ll 

/produc t=" gp4 7" 

/protein_id="CAA07117 . 1" 

/db_xref="GI : 3947466" 

/db_xref=" SPTREMBL : Q9ZXA1" 

/translation="MAINAN7U.SPSLVTELNDLKRRLAAVERKPDVLAKFDRYPPVEW 
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^^RGMVSGhTVWSSCGLANVTGLVFDRVEAKFI^!!^^ 



_ Alitgrsemvrlaafkhdlda 

|^fck:lsasstialtgttsralgtvyvrwlhgt|^fcpaedgaaiytielqhryregp 

^^P&mHLQVYGFTKRENV7^APDGGALGMPNNT^^kvLSETRPNTGIGl^ 

NIJaPWDGSYNISNMHYCVGLPEDRIPTASTGGHWRWRGTNGMKVRDADITEDFNSS" • 
gene 14869.. 15141 

/gene="48" 

CDS 14869.. 15141 

/gene="48" 

/codon_start=l 

/transl_table=ll 

/product=" gp4 8" 

/protein_id=" CAA07118 . 1" 

/db_xref="GI .-3947467" 

/db_xref =" SPTREMBL : Q9ZXA0" 

/translation=" MSLTDITGHADVIGALLIGGFLVYQRVRTGTGNVWRDEAEAQIA 
RSQRLSDDLTLLITEVRNLRDENAGLRLKWRSCGVRIVSFATTLTT" 

gene 15154.. 17001 

/gene=" 4 9" 
CDS 15154.. 17001 

/gene="49" 

/function=" putative tail fibre protein" 

/note=" contains collagen-like repeats" 

/codon_start=l 

/transl_table=ll 

/product^" gp4 9" 

/protein_id=" CAA07119, 1" 

/db_xre f =" GI : 3 9 4 7 4 6 8" 

/ db_x r e f=" S PTREMB L : O 2 1 9 7 6" 

/translation="MAIPNEIPTVRVTGTYLGWDGRALKGTVTFTGPGLVTFPESDLF 
lAGPWCTLDETGQI IDANGNVGVRLPATDS PDMNPSEWTYTVKENLTGWGARTYSM 
VLPKDTLNNSVDLADVAPMPTTPTYVAVPGPSAYEVAVAEGFTGTEAQWLDSLVGRG 
AHAFNGSTVPAASLGIJ^GDTYAQFTVSTTLGVSSTTVrmAKSGGVWSKVSDGVRGAA 
WYTNNTGTPSADVPVGDMLLRVDSGDVYQRGASGWDLKGNIKGAKGDKGDTGATGADS 
TVPGPQGPEGPEGPEGPAGPEGPQGPKGDPGTGSVNSVNGDLGPDITLAAADVGAIPV 
ADKGVASGVATLGTDGLVPTDQLPQLADPNAVTSVNTKPGPTVTLTAADVGALATSTK 
NAADGVAPLDSAKRLPIANVPSAVPKNSWTPQALGFEAWSVYPGGWNPVAKYLTPQR 
LYVTGFNITEPTTVNRIVIFARGWGGVSADRFMAGIYKESTTGRGAVVVKSDSVALPQ 
AGQETGAIJVU<1RSTHVGAVPLPIMTVLQPGRYWVTWLQVNGGTADFAFYHVQNEATI 
STANFFMTDSPWTVRAWYVSDKTALPDTLNQAASDVLANHDI PXMALANV" 

gene 17105.. 18193 

/gene=" 50" 

CDS 17105.. 18193 

/gene=" 50" 
/codon_start=l 
/transl_table=ll 
/product^" gp50" 
/protein id=" CAA07120 . 1" 
/ db_x r e f =" GI:3947469" 
/db_xre f =" SPTREMBL : Q9ZX99" 

/translation=^'MSVAKAEVGYHEGRSGGHWNNHQKYSPAVPGLEWSQNQAWCATF 
VSWAALQAGESAHYPRTASCATGVNWFRNKGRWSAYPAVGAQVFFGNGGGSHTEICYA 
YDADYAYTVGGNTNTNGSAEGDGVYLRKRARRDSYLYGYGYPDVAGGSVSADPDASKF 
GYKHKATGDVGDVGGSTTPDKPAEPSGAVARYKVTINGLEYGYGARGAQVTQVGKALV 
AKGFGDAYKDGPGPNWSDADTTNYAAFQRSLGYSGKDADGVPGEGSLTKLLGKLPAKA 

KPKASYEPFPGAAWFKKNPKSPIVTAMGKRLVAEGCSAYRSGPGAQWTNADKASYAKW 

QRKRGYSGADADGWPGKTTWDALKVPKV" 
gene 18300.. 18479 

/gene=" 51" 
CDS 18300.. 18479 

/gene="51" 

/codon_start=l 

/transl_table=ll 

/product=" gp51" ^ 

/protein_id=" CAA07121 . 1" 
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/db_xr?^"GI : 3947470" 

/ tran^^Pon=" MGDHSKPGGNYARAALSWAKAHPKV^^IwALVGVATAVKPDF 

PGNAVLSTVHAI LGA" 
gene 18543.. 19109 

/gene=" 52" 
CDS ' 18543.. 19109 

/gene=" 52" 

/function=" putative deoxynucleotide monophosphate kinase" 

/codon_start=l 

/transl_table=ll 

/produc t=" gp52" 

/protein_id=" CAA07122 . 1" 

/db_xref="GI: 3947471" 

/db_xre f =" S PTREMBL : Q92X 97" 

/translation="MAYYKSIGLIGRAQSGKDSVGARLRQRYGYQRVAFADPLKAAAL 
RIDPLIPTSYGVQTRIATLVNAVGWDYAKVTYPEVRRILQHVGQTVRDIDPGFWVRAA 

FPAIDAAERLSLPVWTDVRYENEARALRDRGFSMVRVTRPGTLKPDAHKSETELDNW 

ATALT I S NT GT LED LNR I VDS LLLPRSR" 
gene 19274.. 21325 

/gene=" c" 
CDS 19274.. 21325 

/gene=" c" 

/f-unct:ion=" repressor of lytic genes" 

/codon_start=l 

/transl_table=ll 

/evidence=experimental 

/product=" repressor" 

/protein_id=" CAA07123 . 1" 

/dh_xref="GI : 3947472" 

/ transla tion=" MKRVTLGGGKAVHYSTTPDGFMA5PACGGNRASERYVPTDADVT 

CKRCAKILAAEAEREERLNRDPRGDEWMGRTIGDAVTVTLHGRTFDTELTGADHITPG 

WTVAYVT>EDGQRNGTF\nrVTDADIQDGDKVSDPRKDAFDKARALGMDWAEALDY^ 

TAEMAQPTHVSSVESATHDNDDNKGTGTMATKKLKLKDVRGDVRIGAVPGADAIHALR 

NAVDENGRNLPMCRTRTKNPIQYWGPAAEQKPELELCAGCSKWPTGEVSVSEESVEV 

PGLSMI^SQKSYTPVEGDDKGENMAAKNDTQDVDAQISAVHGHVDNIKTAETVEAVKE 

AAEAAEGIITTLPTKHRNTLRSTVKEARTARETELTPVTPEAEAAKAEVESRRSADVA 

EDFNDIEGVPDLIKDGVKLFSQGVDLGLKLTNAGEKLAHVMLTMRQKIVNPATGLPDL 

TAERKTTKNAAAEVYAQTOCKRIADDDVERQGAHNSLVRATQNKASDVLVDCVRAFDGP 

DRKESLAVTVSELFGDKLDGLKDDASISEAIYRLYAGQGIELPRYGRTELARYDRRVKA 

lEGATKELETLTDGDKDANPKDVEALEEKIKELKAEVPEEILTEKLEPKAEKSDAEKT 

ADALKVIRAQVDKAGKRFAKVKTANEKRKAKAELYSIIRAAADAFDLDLSALVTADED 
E" 

gene 21396.. 21569 

/gene=" 54" 
CDS 21396.. 21569 

/gene=" 54" 

/codon_start=l 

/ trans l_table= 11 

/product=" gp54" 

/protein_id="CAA07124 . 1" 

/db_xref="Gr : 3 947473" 

/db_xref=" S PTREMBL : Q9ZX96" 

/translation="MPGYMWESLFADGKTHAVLKDSATPQTTECGTAAGFPVGGSTP 
PTCTPCAEAVRDA" 

gene 21820.. 22665 

/gene=" 1" 
CDS 21820.. 22665 

/gene=" 1" 

/codon_start=l 

/transl_table=ll 

/produc t=" gpl" 

/protein_id=" CAA07125 . 1" 

/db_xref="GI : 3947474" 
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^fc_xref SPTREMBL : Q38022" 
GEYVGR 



D_xref SPTREMBL : Q38022" 
JlAATIAVDSIDFVTyDMAARDREGAIQYISd^lPRYTKV^^ 

^ JIyvgrvradlt p y ve h fre fl kavn pe lvrae dvaws dt y g yags FDWMR vwldad** 
gnptpdrsgkphlimgdwktskatypdvalqmsaymnadfiidpdgnrepmpefdgtv?^ 
vlhvtdetwafkpvetgpdvfaqflhlrqtfdwdrdgsrkvigkpvarkasgrmvtgt 

QRRAR" 

gene 22722.-22943 

/gene=" 2" 
CDS 22722.. 22943 

/gene=" 2" 

/codon_start=l 

/transl_table=ll 

/produc t=" gp2" 

/protein_id=" CAA07126 . 1" 

/db_xref=" GI : 3947475" 

/db_xref="SPTREMBL:Q38023" 

/translation=" MRRSEVVEFLAFVGWFVAAWGIALSAF\A/MILIGWWHGHNAA 

VPALGF I DCLYAVGLTS LLALI VTPVTRD" 
gene 23098, ,23832 

/gene=" 3" 
CDS 23098.. 23832 

/gene=" 3" 

/codon_start=l 

/transl_table=ll 

/produc t=" gp 3" 

/protein_id=" CAA07127 . 1" 

/db_xre f =" GT : 3 94 7 4 7 6" 

/translation=" MAKRSIWAGDEDNKPKKRETYADDTVGRFHSGYSETNERGKWP 
VALDKWRISTGEQSVADAVAQLFGGTPVENEESTSENFIDVFTDRPKSPWIIEADGIH 
WDMKLWLNGKLKHHCDGFDFVSHADEEMIGQPCGCPKLFDERKAAAKEYDAPNPAITV 

TFTIADDPELGRFKFQTGSWTLFKVLHEAEDDVERVGKGGAVLANLELELVEYTPKRG 

PMRNKLVSYYKPTI TVLKS YNDAIAjy' 
gene 23832.. 24014 

/gene=" 4a" 
CDS 23832.. 24014 

/gene=" 4a" 

/codon_start=l 

/transl_table=ll 

/product^" gp4a" 

/protein_id=" CAA07128 . 1" 

/db_xref=" GI : 3947477" 

/ db_x r e f =" S PTREMBL : Q 9 2X 9 5" 

/ translation=" MAALDPASPEAWAHTMRTASDDAVKAPLWQYP PEAR RAVLS ERA 

RRFGI PTADDFDPE Yir' 
gene 24014.. 24304 

/gene=" 4" 
CDS 24014.. 24304 

/gene=" 4" 

/codon_start=l 

/transl_table=ll 

/produc t=" gp4" 

/protein_id=" C7VA07129 . 1" 

/db_xref=" GI : 3947478" 

/db_xref =" SPTREMBL : Q38 025" 

/translation="MGKRGTVTDYAGEALYVGDLINYATRCGNRARASDGIIRKIEIR 

TAYGKLVPFLQVQPTGVDSGYGLGERKTLRKEWITTEHARLLRSNVTGEQNG- 
gene 24388.. 24573 

/gene=" 5" 

CDS 24388.-24573 

/gene=" 5" 

/codon_start=l 

/transl_table=ll 

/product=" gp5" 
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/proterfT id=" CAA07130 . 1" 
/db_x^^k'* GI : 3947479" 
/db_xB^* SPTREMBL :Q38 026" 

/transration=" MHTGLFIGPDAAPALGDVRALKPGDTVYLKPGATERKDWGRYLD 

ALSNAITRGASWWWGE" 
gene 24570.. 24695 

/gene=" 6" 
CDS 24570.. 24695 

/gene=" 6" 

/codon_start=l 

/transl_table=ll 

/product=" gp6" 

/protein_id=" CAA07131 . 1" 

/db_xref=" GI : 3947480" 

/db_xre f =" SPTREMBL : Q3 8 02 7" 

/ 1 ransla tion=" MSSEVFNAFVGGVMVGICVGVLAIAITAMVCTUiRERVRN^^^ 
gene 24685.. 25050 

/gene=" 7" 

CDS 24685.. 25050 

/gene=" 7" 

/codon_start=l 

/transl_table=ll 

/product=" gp7" 

/pr6tein_id=" CAA07132 . 1" 

/db_xref=" GI : 3947481" 

/db_xre f=" SPTREMBL : Q38 028" 

/translation="MAHESKCPCQPCRNKRRKAYIKDYYRKLPRDKRHTLSQKRRATA 

YGVEHEEYSRTEIMRRWGYRCAYCDAKATHLDHVHPLSKGGADAAHNMLPACAKCNLS 

KGAKTLAEWALTFGPKPAD" 
gene 25107.. 25367 

/gene=" 8" 
CDS 25107.. 25367 

/gene=" 8" 

/codon_start=l 

/transl_table=ll 

/product^" gp8" 

/protein_id="CAA07133. 1" 

/db_xref="GI : 3947482" 

/db_xre f =" S PTREMBL : Q38 02 9" 

/translation="MDFASILGRFKAVSEEPDGGYLALCPAHSDSRPSLRIWRGDDLK 
VRLTCRAGCDTGDWSSVGLKWSDLFNASGEGLTVPKRSRRW" 
gene 25361.. 27523 

/gene=" 9" 
CDS 25361.. 27523 

/gene=" 9" 
/codon_start=l 
/transl_table=ll 
/produc t=" gp 9a" 
/protein_id=" CAA07134 . 1" 
/db_xref="GI : 3947483" 

/translation="MVSGAPVTRLRMWLESLPLTQDAADYAADRFGLDVAQAEALGLR 
YSPDGQGYDWPDFVSTSFARFPRMWPLKGFDGVTRGAQGRDLSGKCPGRWLSLKNPD 
GQRWAPYGVFRGDAGYGWLITEGPGDALTAVSVGYDAVAVRGASLVNNPELVAELAE 
GLKGFQVIVCGDNDTAGVGFTLRLSEGLAGHGIDAYALNVPVPGDDLTDWRERDPGKF 
PSRLHDAVKSARPVRDRAOVEAEHRKAEVAHRTGAVQVSSTQGADAARILGDLVSTYG 
ESDAMNAHALVAWTDGRIKYASGLGYFVWDGVTWVKSATRVRQEIHAMGAALVLAGCL 
PESRGFTMTTRIDALWTELRSVPSVHVEAEEFDANAHLLSFANGWDLRTGKLRAHDK 
GDMLTVSLPIEYDPNAQAPRWEQFLQEIFPNNADLVGYMRRLVGYGITGNTSEQCFAV 
LWGKGANGKSVFTETLTDVFGRITKTTPFATFEDKGNGGGIPNDLAALRGSRLVMASE 
GESGKPMSEAVLKRVTGKDKVTARFLRQEFFTFAPTFLIMLATNHKPKFKSQDEGLWR 
RVKLIPFVRYFAPEERDYDLDRKLRAESAGIVAWAVRGAVEWYANGLGDPESISTATR 
EYRATSDALAGFFPGVLDAADDSAIVSGADAYNSYRDWCEAEGLKSTEVWSRKAFYGA 
MEERGIGKKKTNTGIALVGVKF7U)APAAATGPGIFGKD" 
gene 27632.. 29482 
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rne=" 11" 
CDS JSK2.. 29482 



/tunc 



le="ll" 
Eunction="DNA polymerase" 
/codon_start=l 
/transl_table=ll 
/pr oduc t =" gp 1 1" 
/protein_id=" CAA07135 . 1" 
/db_x re f =" GI : 3 9 4 7 4 8 4" 

/translation="MEYHHDVRGDVITVFIPETERDLREFMHWARNKPELALDTETTG 
LAMYSSGYKLRTVQFGTAHEAWVIHYELGGRFAEAADYVLKHCPRFLIHNAPFDWLVL 
DAHTPVSMESLAPRTVDTKIKATLIDPRQPQDGGIGTGLKPLSAFYVDPSAPDTQGDL 
TAVFRSLGLTKETGWAGIDLRHPTYNLYAGLDVIYTARLNPCLDMHARLSIRPTLLE 
YEHEIAYMCAYMQRAGIJ^DLEYVDTLRRMLREEEEKHLYAAAMWGVDSVNSGAQVAE 
ALIJ^GETLTQRTDGGALKVDKAVLLPIAD LDRDWE RI GT^E PNPIJ^VLRAKRAGK 
WVTTYADRFANNHDPHGRIHPNINTLQARTGRMSINGDFAAQTLPSSDWMIRRAIVGD 
APDHIMGSVDFQAIEMRVIAAIJUDVKRMKDGFVTJGGSDFDIHMyTAQLIKGLEATKRD 
RKVFKGAGFGKVYGGGVATIARQTGATEAEIARAVAEYDRVFPEIKRASSRWQREARG 
TGLVTVSVTGRRLPLDR^mTyAVVNYQCQSAARDVLGQAMLNMRDAGLLDY^IKLPIHD 
E IVFSAPKADAKDIARE FEKCMTMDLFGVPWADADLGGRSWGSLYGADV" 

gene 29957.. 31147 

/gene=" 12" 

CDS 29957.. 31147 

/gene=" 12" 
/codon_start=l 
/ transl_table=l 1 
/p r oduc t=" gp 1 2 " 
/protein_id="CAA07136. 1" 
/db_xref="GI : 3947485" 
/db_xref="SPTREMBL:Q38033" 

/ 1 r ans 1 a t ion==" MLTIETIRAAQSADDLADRIjaj^REVIEATDSRWAIJU^K?^ 

MAPHGGARFADWADEFTQVGRVAVWDCLKRFTDTTVDAFERYVYATVDGTLKDAVREE 

RNGNAGADENAVKVYASMLEAAEGDVYEAARLAQIIPPKGKRLSKERAEAARLAWQGA 

VSLDKVTSAENADTUDGSLADTLKHYDEEPDGEIRPKVGRGALIEAAYVLERYVSVPRD 

AEARTCVLDALELATQGETTPADVTU^LEDVLTVPSDPTERRYVLDALAVLHAAVSTST 

EGEVADDLRDVRDDRMADSREKHARVNDCIESMGATQRDILKHSFGIGGVTDYGHGDG 

CDMEGMCAQVGVTYVQLKSYRPKARKAFAKRYVAAVKLTGAEAIAAVLEAAAAERLTN 
AGRK" 

gene 31241.. 31408 

/gene=" 13" 
CDS 31241.. 31408 

/gene=" 13" 

/codon_start=l 

/transl_table=ll 

/product=" gpl3" 

/protein_id=" CAA07137 , 1" 

/db_xr e f =" GI : 3 9 4 7 4 8 6" 

/db_xr e f =" S PTREMBL : Q3 8 0 3 4" 

/translation="MQTFTLPTGHTVTTQRVGANVEFVTANADGDVISTVQHSFAESV 
PLIKRLACRTR" 

gene 31550.. 31945 

/gene=" 14" 
CDS 31550.. 31945 

/gene=" 14" 

/ codon_start=l 

/transl_table=ll 

/product=" gpl4" 

/protein_id=" CAA07138 . 1" 

/db_xref =" GI : 3 9 4 7 4 8 7" 

/db__x r e f =" S PTREMB L : 0 9 ZX 9 4" 

/translation="MWRVGHGTALDSGFPSGPYTCEGVEDEDAVTCRVWGMASEHSNST 

HPSPFADDALYDIASFERCGFDSRDALNKWFDGWTDALDASGFRVWEYDVPDWAARVG 
KFGQWFSSFEAVEVASYGFEPEQLSLFK" ^xuvt-uwAARVG 
gene 32026.. 32346 
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/gene- 



CDS 32026.^^46 



/gene= 

/codon_start=l 
/transl_table=ll 
/product="gpl5" 
/protein_id=" CAA07139 . 1" 
/ db_x re f =" GI:3947488" 
/ db_xre f =" S PTREMBL : Q9 ZX 9 3" 

/translation=" MRVTTETKAELQAERGRVVGEVKSSEPGKISIAVPADRAVMTPA 

QARQIAAWLNEEADRGGRVSTDARTAEGWRTAEAERQRAIRETLLhA/TDRVTTPSGRTFT 
RRAY" 

gene 32587.. 33300 

/gene=" 16" 
CDS 32587. .33300 

/gene=" 16" 

/codon_start=l 

/transl_table=ll 

/product=" gpl6" 

/protein_id=^' CAA07140 . 1" 

/db_xref="GI: 3947489" 

/db_xref=" S PTREMBL : Q9ZX 92' 

/translation="] 

GR I C YKS FE RF^ PATASNPGYLGNI LAQGHFSVLEHAS VT FLVRDVSRALLTE LSRHR 
HLSFSWSQRYVDHADTEPWPPAIRGTELEKPFREDYAEALQAYDAGVKLLRARGYG 

RKQAREAARALLPNAAPVDMWTGNLRAWRDVLGKRWHVAADAEIREFAGRVLDHLHA 

VAPNSVQDMPTS PFGSDGK" 
gene 33297,. 33566 

/gene=" 17" 
CDS 33297.. 33566 

/gene=" 17" 

/codon_start=l 

/transl_table=ll 

/pr oduc t=" gp 1 7" 

/protein_id="CAA07141. 1" 

/db_xref="GI : 3947490" 

/db_xref=" S PTREMBL : Q92X91" 

/translation="MSCTECKRATGHKLDCGQREPNPFLQLDA7UO:DVIDDMVDEWLD 
RDRHGE LNGGYG YGPKKDRALSRIHEQI KEAWRAKVRYADPEETE" 
gene 33563.. 33757 

/gene=" 18" 
CDS 33563.. 33757 

/gene=" 18" 
/codon_start=l 
/transl_table=ll 
/product=" gpl8" 
/protein_id="CAA07142. 1" 
/db_xref="GI : 3 9474 91" 
/db_xref="S PTREMBL :Q92X 90" 

/ 1 r a n s 1 a t i on=" MKRVAAAI AGVALVGAVTVGCDPGPECIESHS EMTWVPMYNGKT 

TTLQPVWTTVCTKYETET PK" 
gene 33769.. 34020 

/gene=" 19" 
CDS • 33769. .34020 

/gene=" 19" 

/codon_start=l 

/transl_table=ll 

/product=" gpl9" 

/protein_id="CAA07143. 1" 

/db_xref="GI : 39474 92" 

/ db_x r e f =" S PT REMB L : Q 9 ZX 8 9" 

/translation=" MIDLPSSGRSTVKAFREACWALDVEPEFVDVTSLDCRMGVTRV 
PTVRVYADDDPYGDVLAEHRGKATGEEITALLNRGLALV 
gene 34064.. 34417 
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_fene="20" 
^^A., 34411 



gene 34491.. 34763 

/gene="21" 

CDS 34491., 34763 

/gene="21" 



_ ie="20" 

/function^" putative dCMP deaminase" 
/codon_start=l 
/transl_table=ll 
/product=" gp20" 
/protein_id=" CAA07144. 1" 
/db_xre f GI : 3 9 4 7 4 9 3" 
/db_xref =" SPTREMBL : Q9ZX88" 

/translati on=" MATRADCTRSQVGAVLVNANHEVRGTGYNGAPSGVPGCASAGAC 

PRGQLSAVECAPNSDYANCVADHAERNAIRH7VPSAEIAGATLYTTREPCPACWTLIRA 
AGI RRWT PTT S HT F*' 



/codon_start=l 
/transl_table=ll 
/produc t=" gp2 1" 
/protein_id="CAA07145. 1" 
/db_xref="GI : 3947494" 
/db_xr e f s PTREMBL : Q9ZX 8 7" 



gene 35027.. 35383 

/gene=" 22" 

CDS 35027.. 35383 

/gene=" 22" 



/ transl a t ion=" MIANFWEDASAVIRRAPI£DESAIJ>1WIAGPTLPDGSAGEMVSMT 

LETARKLRDRLNAQLAGFEPTPTEPRCTRHGSECDRDPKKAHIFKR" 
35027. .35383 



/ codon_start=l 
/transl_table=ll 
/product=" gp22" 
/protein_id="CAA07146. 1" 
/db_xref="GI: 3947495" 
/db_xre f =" S PTREMBL : Q9ZX 8 6" 



gene 35468.. 35830 

/gene="23" 

CDS 35468.. 35830 

/gene="23" 



/translation="MPHSVPFDLSLSVWVPSGAARAWLPCSVLGGALTDELIQSCDEL 
KAVFKAHGKLVARLLSS PSAPRYDGFRI IGRRKDTGAMVAAVEWVRSRETRELVRGSV 
I WTACKYVHPATALVA" i:*^ vi%»ji> v 



/ codon_s t a r t = 1 
/transl_table=ll 
/product=" gp23'' 
/protein_id=" CAA07147 . 1" 
/db_xref="GI : 39 47 4 96" 
/db_xre f ==" S PTREMBL : Q9ZX 8 5" 



/translation="MPNKITFGASVLASAAT7^GLGALAFrSPNAPETAYPLPTHSAP 

DVEPETAPLSVGATEDPGTVPTIAPVACTAPSSTPKTGRHSKPRTEPETDDTATLPRH 
AKPLATP<5Pc;.«5TTnr:i5na" j.«.ai^irz%n 



AKPLATPSPSSTTPGRAA' 

g«ne 35827.. 36216 

/gene="2 4" 

CDS 35827.. 36216 

/gene=" 2 4" 
/codon_start=l 
/transl_table=ll 
/product=" gp2 4" 
/protein_id="CAA0714 8 . 1" 
/ db_x r e f =" GI:3947497" 
/db_xref=" SPTREMBL :Q9ZX8 4" 



n™^^^^ ^^^^^^'^^^^^^^A^^SRTLPSSSASVSGPFEPAG 

AETPLIAEGGSPEYPYEEAWGDTPDTAADPGWCDKELAEALGVPYSPTCYTGHSPAD 
TWAREEAEADDQPAGPVWDLSGGNGWr- v t-i CXTGHS PAD 
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gene 36289.^7005 
/gene=3 




CDS 36289]IHb05 

/gene=^^5" 
/codon_start=l 
/transl_table=ll 
/product=" gp25" 
/protein_id="CAA07149, 1" 
/db_xref="GI : 39474 98" 
/db_xre f =" S PTREMBL : Q 9 ZX 8 3" 

/ 1 r ans 1 a t i on=" MGTKAIVRTRRVLTGGRWFLILGLVFY SIJ4TTTPFVSAHSEWAW 

SGV7VLGLIVDAAFIM7U,SAEGTLAKYGVTKLGAWPVAFRWITGLSSVFLNVWLSVSAH 

DWVGVAVHLIAPALVMLLAEVGPVYMKALADAEREALS APE PVAE PE PVPAE PE PVTE 

PETAPQAVQDVLPE PTPE PE PE PADDDVPARLPNAQANKI I EEGWRHRLN PVE VAAAA 

GRH PATVRKKFAQLDAE LSV" 
gene 37166.. 37636 

/gene=" 26" 
CDS 37166.. 37636 

/gene=" 26" 

/codon_start=l 

/transl_table=ll 

/produc t=" gp2 6" 

/protein_id=" CAA07150. 1" 

/db_xref=" Gl : 3 9474 99" 

/translation="MGPKTQAAYVIISGWIATGRYGPGDKLPSERAMCEDLGIGRTAL 
RQVLAKLVAEGILEVHQRSAYRVPSGMRISWVIEDHGANEATAPTIDSAAEALTAGVR 
SAYADEDTTTLAHI L FNWGPLRMKLVTDGR FEVERGRTWEAREGGI FVTLS PN" 
gene 37652.. 37954 o^c^i^i r v i i.i>fN 

/gene="27" 
CDS 37652.. 37954 

/gene="27" 
/codon_start=l 
/transl_table=ll 
/product=" gp27" 
/protein_id=" CAA07151. 1" 
/db_xref=" GI : 3947500" 

/translation="MFPMRRYRVTQRPPEWRQELQRTEDGNPPVVRPWVlFDGAMKGY 

^^^f^^J^PSTmPLEWVTQHGAQAWIMRCYRTWGEVPLVGGGAVPYNVARERVGR" 
v^c;ne 3/9d1. .38277 

/gene="28" 

CDS 37951.. 38277 

/gene="28" 

/codon_start=l 

/transl_table=ll 

/p r odu c t =" gp 2 8 " 

/protein_id=" CAA07152 . 1" 

/db_xref="GI : 3947501" 

/db_xref=" SPTREMBL : Q9ZX82" 

/translation="MKRYRVMERALQRRKNGTWVVRKNPPFVIFDEVMGDYCALPODD 

DGEPVTLEWRSRSAAYDWIJUlCLQTWQmERTGRAADVPKAWRGFPVPEQSPWVGY^^ 
PLYGPY" 

gene 38445.. 40262 

/gene=" int" 
CDS 38445.. 40262 

/gene=" int" 

/ f unction=" site--specif ic recombinase" 

/codon_start=l 

/transl_table=ll 

/evidence=experimental 

/pr odu c t=" integrase" 

/protein_id="CAA07l53. 1" 

/db_xref="GI :3947 502" 

GGRFRFVGHFSEAPGTSAFGTAERPEFERILNECRAGRLNMIIVYDVSRFSRLKVMDA 
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tRNA 



gene 
gene 
CDS 



gene 
CDS 



m 

VSELIALGfVTIVSTQEGVPRQGNVMDLIHLli^DASHKESSLKSAKILDTKNLQ 
GYVGGKAPyGFELVSETKEITRNGRM\m\ry^fcu^HSTTPLTGPFEFEPDVIRW 

ikthkhlpfkpgsqaaihpgsitglckrmI^Pf/ptrgetigkktassawdpat- 

vmrilrdpriagfaaeviykkkpdgtpttktegyriqrdpitlrpveldcgpiiepae 
wyelqawldgrgrgkglsrgqailsamdklycecgavmtskrgeesikdsyrcrrrkv^ 
vdpsapgqhegtcwsmaaldkfvaerif^^^irhaegdeetiallweaarrf 

PEKSGERANLVAERADALNALEELYEDRAAGAYDGPVGRKHFRKQQAALTLRQQGME 

RIAELEAAEAPKLPLDQWFPEDADADPTGPKSWWGRASVDDKRVFVGLFVDKIVVTKS 

TT GRGQGT PI EKRAS ITWAK PPTDDDEDDAQDGTEDVAA" 

40706. .40781 

/gene=" tRNA-Thr" 

/note="codon recognized: ACff' 

/product=" tRNA-Thr"' 

/anticodon= (pos : 40735. . 40738, aa :Thr) 

40706. .40781 

/gene=" tRNA-Thr" 

40777. . 41013 

/gene="29" 

40777. .41013 

/gene="29" 

/codon_start=l 

/transl_table=ll 

/product=" gp29" 

/protein_id=" CAA07154 . 1" 

/db_xref="GI : 3947503" 

/db_xref =" SPTREMBL : Q378 33" 

/translation="MSGYTIAWLAWLAAFGVIEGRALFNKKPGDTLSEHVWSWFATQS 

GSTGKPSGWVRARRFAIiLAFMGWLTAHFMTGGRF" 

41097. .41444 

/gene="30" 

41097. .41444 

/gene="30" 

/codon_start=l 

/transl_table=ll 

/product=" gp30" 

/protein_id=" CAA07155 . 1" 

/db_xre f =" GI : 3947504" 

/db_xref=" SPTREMBL : Q37834" 

/translation="MRTRCLDCRDWATHGGRCAQHHATYQAQRSVKSHAKRRAAIARG 
NNAAAKMRRAIRKAVGAHCATCLGWYLPSQLDVDHIKPLALGGEDVEGNVQALCKRCH 
KTKTAMDFGKRPF*' 
a 12659 c 13732 g 7467 t 2 others 



BASE COUNT 
ORIGIN 

1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 



7629 



cccggcccca 
ccaacgaccc 
aacgacccgt 
aacgccaacg 
gtgcccgcgc 
atgggcgcct 
gaccgtcgcg 
aacggtcagc 
cggtcgatag 
gccgctgaag 
cccagggttt 
gtacacggcg 
cttgaccgtc 
gtacagaccc 
ctggtattcg 
gtgccagacg 
gtcgcgcttc 
ggcttcccag 
gtaaagctct 
gtaccgggtc 
ttacagaacg 



gctcggaaaa 
gtgtgaacgc 
taagggggtg 
ctgccgctga 
acctgaaggc 
actcccccga 
ccgacctgtt 
cggtcgcgca 
agacggcaat 
cccggaaggt 
cccccggggc 
cgggtggcct 
tcgcccttgt 
cgtgcccagt 
tcgctgacgc 
tagtcacctt 
atgcggtcgc 
caagcggcgt 
tcgccccgga 
ggggtggcgt 
aagcctagca 



cgcgcgctag 
tgggcaagcg 
ctaagtgagt 
gtccgctgcc 
gaccggcaag 
cacggaccgg 
gtcgctcatc 
cccgatgctt 
cggcttcacg 
tgccgccggt 
atcttccctt 
cagcctcagc 
tggcgtaggc 
cgcgcgccca 
gctgaccctt 
cccgaaggcc 
gcttctcggc 
cggcaacggc 
gcacaacgcg 
tggcgttcat 
tgcctactca 



gtgagccgcc 
gcacttcggc 
cgagcgaaga 
cccgtcgtgt 
gacgtatggc 
aacgtgatct 
gaagccgacg 
cgctatgtcg 
cctgaagcgc 
cctgaagact 
ccctgccgct 
gcgcttcagc 
gccgaaggtg 
ggcgccggag 
ccggttgtaa 
ctcaaccggc 
gggagtctcg 
catgtcgagc 
gacgtagtga 
cttcggggct 
cgtgagtagg 



ggacccaggc 
acgctgagaa 
gcccggagct 
acgagggtcg 
ggaacgtgtg 
tgcgttactg 
ggtacatgtc 
agtcgactga 
gtatgcgcct 
tctgagccaa 
tagcgcttgc 
gccttcgggt 
aagccgacga 
cactcgaccc 
gccgccttca 
ttgtcgactc 
gcgggggcga 
gtgtagtaac 
aggccgttcg 
ccgtttcgtt 
cagcttccac 



gccccccagg 
tgggtcagtt 
tcggaccggg 
agcgccccgt 
gcaagccggt 
cgagcttcac 
tgagggttac 
gaaagagctt 
gggcatcgtt 
cgaaggaagc 
tgaacgcggc 
cccggtgaac 
gccggtaacc 
attcggacac 
tggtgtacgt 
cggcggcttc 
cgacccggaa 
ccggcgtgcg 
tcttgccgaa 
cgcttcgttc 
aaactttctt 
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1261 gagatttttt cggggg^^c cgtggcggac tgggccggaa tcgaccccgt gattgcgcgg 
1321 cacattcccg ctgaco^^ ggttcccagt gagggttacc gggtcg|flfe| gtggatcgaa 
1381 g^gttttgct acctgj^H^ ttcgttcgcc ggtcagccgt tccggi^^l tccgtggcag 
1441 cgcgaacttc tcattg3l§c gtacgtgttg actcaggaca ccttcggccg ttggcgccgg 
1501 aagcatcgga ccgttgttgt gtgcgtggcg cgcaagaacg ggaagagcac cattgccgcc 
1561 gcgatcatgc tttaccacct gatagccgac cggggcgacg ctcagcgaca gatcatcgct 
1621 gccgccaacg accgcaatca ggcgcgcatg gtcttcgact ccgcgaagca aatggtgaac 
1681 gcgagcccga agcttgccgc cgtgtgcgac gttcagcgcg acgtgatccg gtacaaggac 
1741 aacacttacc gggtcgtgtc ggcggacgcc ggacggcaac agggcttgaa ccctgccgct 
1801 gtgtcgctcg atgagtacgc gttcagcaag cacagcgacc tattcgacgc gctgacgctg 
1861 ggcagtgccg cccgtaatca gcctatgttc ttgatcatct cgacggccgg acccgacccc 
1921 gacggaccct ttgccgcact gtgcgagcaa ggtgagcggg tcaactccgg tgaggctgac 
1981 gacccgacgc ttttctatcg gtcctggggg ccgaagctgg gtgagacggt cgaccacctt 
2041 gacccggacg tgtggcgcgc gtgtaacccg tcgtacgaca ttctcaaccc ggacgacttc 
2101 aaggcagcgg cacagcgaag cactgaagct agcttccgta tctaccggct gagtcagttc 
2161 gtgcgcggtg cgtcgacatg gttgccgcat ggtctttggg attcgcttgc tgccgacgac 
2221 gacccgcttg agccggggga cgaagtcgta ttgggcttcg atggttcttg gaagggcgac 
2281 agtacggcgc ttgtggcttg ccgcattcgc gacctgaagg tgttcgttct ggggcactgg 
2341 gaagctccgg cggatgacgc ccattggcgc gtgcctatgg cggacgtccg cgaagagcta 
2401 cacacggcgc tcgacgtgta ccgggtgcgg aaccttgtcg ctgacccgta ccgctgggaa 
2461 gagacgctag acaatctcga agccgacggc ttcccggttg aggcgttccc gactaactct 
ctcgcgcgca tggtgcctgc cactcaggcc gtgtacgacg cgtgccgtga cggtcgactg 
2581 agccacgacg gcaacccggc gttgggtcgg cacatcggta acgcggtact gaaggaaqac 
2641 gcccggggcg cgcgcatcac gaaggaacac gcgtcgtcgc gccggaagat cgaccttgcc 
2701 gttgcaatgg tgctcgccgt tcacggcgct gtgatgtggc gtgaagacaa cggcatcgtg 
2761 agcgacaagc caattatcgc gacctgggaa gacgacgaag gcaacgtctt cgtgcacccg 
2821 gatcacgcgg agttcttctg aatccaacct actcacgtga gtaggtaaga cccgaagggg 
2881 gcacggtggg tttttggtct gcactcttcg ggcgggggca ctccccagcg ctcgacggca 
11.1. ttgaggcgcg agcctgggaa ccgtacgacc caagcattta caacctgggc gccgttgcgg 
3001 cttccggcga gacggtcact ccgcatgacg cgcttcaggt gtcggcagtg ttcgcgtcgg 
3061 tccggcttct gtcggagacg attgccacgc tgccgctgag cacgtacagc aagcggggcg 
3121 gctcgcgtaa ggagatagtg acgccggaat ggctcgacta tccgaacgct gagccgggcg 
3181 gtatgggccg gatcgacatt ctgtctcaga ccgtcctcag cttgcttctt caggggaacg 
cgttccttgc cgtgcgctgg cagggtccga acatcgttgg ccttgacgtg ctcgacccga 
3301 cgaagattca cgttcacatg gtcatggtcg acggtcttcg ccggaaggtg tttgaggcgt 
acgacattga cgccgacggg aacgaagtcc tgttgggttg gttcacgccg cgcgacgtcc 
IaI. '^^^^^^^ttcc cggaatgatg ctgccgggtg acttcgtcgg gtgctcgcct atctcgtatg 
3481 cgcgtgagtc catcgggctc gcccttgccg ctcagaagta cggcagcaag ttctttgcca 
3541 atggcgccat gccgggcgct gtggttgagg taccgggcac gatgagcgaa gagggtttgg 
3601 cccgtgcgcg cgaagcgtgg cgtgccgcta actccggcgt cgacaacgcg caccgcgtag 
ggagggtgcg aagttttcga aggtggctat gagccccgac gaagcccagt 
^?fl? ccgtcagttt caggttccgg aaatcgcgcg aatctttggc gtgccgccgc 

III] ^cctgatttc ggacgctacc aactcgacgt catggggcag cgggcttgct gaacagaaca 
3^0? Inftr^l^'t^ catgttcagt cttcgcccgt ggcttgagcg catcgaagcc gggttcaatc 
3901 ggcttctgtt cgccgagacg gccgaccgct tcaggttcgt gaagttcaac cttgatgaga 
lltl lllt^t^ttf gaacgtatgg agctttggag cctgggtctt cagaacggca 

I^ni tgacgaagtg cgcgccgctg aagacatgac gcccctgccc gacggattgg 

4081 gcgagaagta ccgggtgccg ctgaacctgg gcgaagtcgg cgaagagccg gagcctgagc 
4141 ctgcccccgc tcccccagcc attgagcctc cggcggaaga gccggacgaa gagccggagc 
4201 cggaaggcaa gccggacgac gaaggggcaa ctgaagaaga tgac|aagac ga?gcg?g2g 
tlti ^<=f ^99=^^ ccttgaagag cgcgcgtcgg atgacgggcg catttctatg 

438? Jllttlt-tt ^^'^^^^^^^ caacgaactg agtcacgacc tgggcggctt ccgggaacgc 
4381 atcgttcctg gggcaggggc gccgtcgctg cgacagaacg acgtgtacgc cacgttcaac 
4441 cacgacacgc gttccctact ggggcgcacg tcttccggca cgctgcgagt cggcgaagac 
4501 cgtgaaggcg gatggtatga gatcgacctt cccgacacga ccgttggtcg cgacgtcgct 
4561 aagctactga agcggggcga ccttcagggt tcgtccttca ccttcc^cg? g^tcgacggt 
ttl] ggg^^gcgac gggcggacga cgacgaccct gagacggggc ttcccgttcg ggagatcacg 
TlA tggtcgagct ggggccggtt gtgaatccgg cgtacccaac IfcLaggc? 

aIoi llTnllt^ cgattgaaga gacgcttcgt atcggtgagt tcgcgccgcc gactgaagag 
^^^^attccc agccggacgg cgacgttgcc ccggcttctc atttcgacgc gcgtLcct? 
till llltr^''^'' tttctaagta aggagtgtcc atatggacgc gaccactc?g fglgccaact 
till llaltttrj ^^^r^^^^^ accgctgagc ttcggtccct gacggatgag ttcgccggta 
snJJ ffgagatgac cgctgaggcg cgcgagaagg aagagcgtct tctcactgcg gtcgccgact 
5041 ttgacggccg gatcaagcgc ggtatcgacg ccattaaggc gacggacgct gtgacgtctc 
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5101 ttctgtccgg^Pttcagggt tcgggctccg gcgctcagcg ttc cgccgac cacgacgacg 
5161 acgccgttct^^tgcgggc aacctgggtg aggcacggtc ^^rgagttc gcccctgaga 
5221 agcgcgacgg^^^gaaggcc ggtaacccga acgttctgag ^(^accctt tacggtcagc 
5281 tcatcgctca ggctgtcgag cgttccgcga tcatgcgcgg tggcgcgtcg accttcacga 
5341 cgtccgacgc gaacccgatg gacttcacgg tgatcacggg tcgggcgacc gccgggattg 
5401 tcggcgagac ggccgagatt cctgagagct acccggccac gactcagcgt tccatgggcg 
5461 gcttcaagta cggcttcgct tctgtcgtgt cgtatgagtt cgccactgat caggttcttg 
5521 accttgtcgg cttccttgtc tccgacgccg gtccggccat cggtgacgcc atgggtcgcc 
5581 acttcctgac gggtaccggc acgggtcagc cgcgcggcat cctgacggac gcgaccggcg 
5641 ccaacgctgc cttcggtgag gctgacgccg actccaaggt ttccgacgcg ctgattgatc 
5701 tcttccatga ggtcccgtcg gcgtaccgca agaacgcgaa gttcgtcgtg aacgaccttc 
5761 gtgcggctca gatgcggaag ctgaaggacg cgaacggtca gtacctgtgg cagtccgctc 
5821 ttaccgtcgg cgccccggac accttcaacg gcaaggtcgt tgagacggac gacggcatgc 
5881 ctgccgacaa ggttctgttc gccgacctga gcaagtaccg ggttcgcttc gccggttcgc 
5941 tccgtgtcga ccgttcggtt gacgcgaagt tcagcactga ccagatcgtt taccgattcc 
6001 ttcagcgcgc cgacggtctt cttgtcgacg cccggggcgc gaaggttctg acggtcactc 
6061 ccgctgcctg atctgactag gtgaacgggc gctcagcgta ctcacgtgag taggttgggc 
6121 gcccttccct gaagggggca gcaatggctt acgccacgat tgaagagctt cgcgcgctcg 
6181 acggtcttga cgactccgcg cttttctccg atgaacttct gtccgacgca atcgacttca 
6241 gcgttgagac ggtagaggcg tactgcggtc ggaagtggga cacggccgaa gacccgacgc 
6301 cggaaacgat tcggtggtgt gtgcgcacac tcgcgcgaca atacgtgctc gaccacgtgt 
6361 cgcgcattcc cgatcgggcg cttcagcttc agtctgagtt cggcagcatt cagcttgccc 
6421 aggctggggg aaattggcgc ccgacgtcgc tgcctgaagt gaacgcgaag ctaaaccttt 
64 81 accgcgtccg actcccgttc atcttcatgt gaggcactgt ggcgctcatc tttaacgcga 
6541 aggtgcgact gttcgaagcc ctgaaggcca acgtgccggg tgacgttcag tgcaccttcg 
6601 cggagacggg agacaactcc cgtagaaaac aagtgtggtt gggcgcgacg gtagacgacg 
6661 accttgcccc cgtggctatg cgctccggcg cgaagccaac caacgtgacc ggctacgtag 
6721 aagcgcacgc cgttgtcacg acgccgggca atccggtcga cgctgagcgc gccgtgtatg 
6781 gcattcggga ttacgtgaag gccgcttgcg ctgccgtgaa cgccgacctt gtgtcggtcc 
6841 cggggctcat ggacgtacgg ccggagtcgg cttctgtcga gtccactgaa accactgacg 
6901 gcgcgtacag cgctctaacg gtccgtgtgc gggtgcgcgg tcgcgtctat cagtgacacc 
6961 cgcttagacg acgacctagg ggttttgcat ggcactcgac gcaagcattg gcatcgggcg 
7021 ggaagacacg tacggcactc tcagcggagt cgttgagggt tacgaggggc aggcggactc 
7081 ttggaagcaa acggtcgagc cgattgagtc tgtcggcttc cgggccggta tgcagactgc 
7141 ccgcgctgac cgccggaaca tggtcaacat gggtggcgaa ggcgaacttg agattgacgt 
7201 actcgacgcc ggagccggtt cgcttctgac ggcggcgttc gacaaggtca ccgttactga 
7261 cacgggcggc gtgaagacga ccgttcttga gacgtccgac gtgtctgagg ctccgtcctt 
7321 cacggctcag atggttcggc ccggggtcga cggctcgaag gtcgccttca agcacctggg 
7381 ttgcgtggcg accgaatgga gcctgacggc cgaagttgag gaagctgtga agcttgccgt 
7441 caccttcgac ttccgtgacg ccacgcacac aagcactccg gctgagattg tcgagccgac 
7501 ctatcccgct gaggcgtacc cgtacgactg gacgcgtacg gccgttgagc tttcccgtgc 
7561 gggcagcgtg gtcgagttcg acgcgacgtc gcttgagctg accggcgaac tgggcatgaa 
7621 gaccgaccgg cgttttctca atggttcgga gctgaagaag aagccggttc gtaacgcgct 
7681 gccgacgtac gaaggcacgc ttgagggtga gttcagcgcc gcttcgctgg gactgtacga 
7741 cgcgtttgtg tccggcgaag tgtgctcctt caaggcaacg ttcttcggca tcctgcccgg 
7801 ctcttcgctg agcgttgagg ctccggcgat tcagttcacg ggcgagtctc ccgaagcggc 
7861 gacggacgaa gtcacgattc acaatctgcc gttccgtgtg ctcgacccgg gcgacggcgt 
7921 gacgcccgca atcaagctca cgtacgtcga gcccggaacg ccggtcgagc cgtaatggca 
7981 cagcggagcg cgtacacgat tcaggttgat ggactccgtc agtttcagcg gaacgtgcgc 
8 041 gcgctccgcg acaaggaatt gaacaaggcc gtgcgtgaag ccaacaaggc ttccggcgaa 
8101 gttctgattc cccaggcgaa gcacgaaagc ccggacggtc accgcgaccc gaagtcgagc 
8161 aagcgttacc gtccgggcaa gctcgacaag tccattaagg tcacggcgtc cgcgaagggc 
8221 gccgtcatca aggcaggctc agcggctcgc gtgccgtatg ccgccgcaat tcacttcggt 
8281 taccggaagc gcaacatctc tgcgaaccgg ttcctttacc gcgccatggc ccgtaagtcg 
8341 gacgtcgtgg cggcaactta cgaacggcgt attgccgccg ttgtcgaaaa gtatttggag 
8401 agctaatgcc ccagcgcaag cctgccttcg agattcccga cgacttcacg cttgacctga 
8461 agcttgattc gctgaccatt gacgagatcg acgccattga agagatcacg ggtcagccgc 
8521 tcgactccct gaacaaggcc ggttcgcgcc gtgccccgat gcttcgcgcc atggcgtatg 
8581 tcgtcatgaa gcgcaagttc cctgagatcg agcccgccga cgtcgggaag ctgaagctga 
8641 acctgaaggg gaaggcgaag ccggacccta ccgcgaccaa cgcgtaattg cgtgcgcacg 
8701 tctggtcagt cacttcaggg ggttgacgtg gtcggacgtg cgcggcatgg agcttcgcga 
lot. ^^^^^^^^^^ ttggttgaac agatggcagc ggacatagag gctgagcgca gagacacgaa 
8821 gcgcgccggt cgggggcgaa gcgggggcac tgagcgccgc acgcccgtca tgacgtaagg 
8881 gggcgctgtg gctcgaccca ttcagatcac gattatgggc gacgccgacc aactttcaga 
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8941 gacgcttgat caggcgWcg aagaagtgtc ggcgttcggc gaacaqgcaa aggggcttgc 
9001 ccttgctgcg ggtggo^ tcgccgtcgg cattggcgcc ggtat J|k aggcgcttga 
?061 gcgggaagcc gggaa^p tccttgccgc ccagttgggc gcgac^i^ ctgaggctaa 
9121 gcgcctgggt gaagccgccg gtgaagtgta ttcggccggt tacggcgaat ccgtcgccga 
9181 cgcgaacgaa gccctgaagg gtctttggca acaggggctc gttccggccg gagcgacggc 
9241 cgacgacaat ggcgaacatt tcgaaaaggc tatggacgtc gcgacggtcc tgggcgacga 
9301 agttgggccg acgtcgaatg ccgttgggca gatgctcaaa accggcatgg cgaagaacgc 
9361 cgacgaagcg tttgacattc tcgttcgtgg cgcccaggaa ggcgcgaata agagcgaaga 
9421 cctgttggac acgttcaacg aatacggcgt tcagttcaag ggaatcgggc tcgacggtaa 
9481 gacggcaatg gggcttctgt ctcagggtct tcagggtggc gctcgcgacg ctgaccttgt 
9541 cgctgactcc ctgaaggaat tcggtcttat cgttcgcgcg ggtggcgacg aagtgaacgc 
9601 ggcgtacaag tccatggggc tgaacggcgc cgagatgacg aaggccattg cccagggcgg 
9661 accggtcgcg aaggacgccc tagacaagac gcttgacggg ctccggaaga tcaaggaccc 
9721 ggcggagcgt atcgcgaccg ctgtgacgct cttcggcacc caggctgagg acatgcaaga 
9781 cgcgttgctg aagctcgacc cgtcgtcggc cgttgagacg ctgggcaagg tcgacggcgc 
9841 ggcgaagtcg gcaggcgaaa cgatgcacga caacgcggca acgaagatca aggcgttcac 
9901 gcgcggtctt cagactgggt tggtcgactt catcgggggc acggtccttc ccattcttga 
ino^J cccgcgcttg agggcatcgg gtcgactatg gcgacggtcg gcggcttcgt 

5'^*=59agcac agtacgacct tcaaggttgt cgctgggatt atcacggccg ttcttcttcc 
10081 ggcgctgatt cagtggggcg ttcagtcgac catcaacgcc ggtaaggcag tggtcgcctg 
lola. ^"J'^^f" agcgcgacgg cggtaatcga atccacgaag caagcccta? caLcgcgaa 
in;«i ^gtcgttgcc gggtggatag cgtcgggcgt tcaggccgga ctcaacgcgg cgaaggtggt 
10261 tgcgggttgg gtgcttatgg gcgctcagtc catgattcag ggcgcccqla tagcoacggc 
10381 t^^rft::':^ ^-ta^gggtc cgattccgct gatcatcgct gccattgtcg ggttggtcgt 
^nll^ tctcatcgtg gccaactggg acaagatttg ggcctatacc aaaaaggtct ttcaatggit 
10441 ttgggactgg gtcaagaaaa tcttcaattg gctggaggac ctgtttctga acttcaccgg 
loH\ tttt^fJ: ctgatcaagc attgggacaa gagccggtcg gcgacgaaga acaccttcaa 
lolt] aacttcgcga aggacgcact gaacgcggtc gtcaatttcg tgaaggggct 

JSIeJ lr.lT.T. ^^^"^^^^55 cagcgtcgtc gttgctcagc gccggtaagc gaatlggcgg 
Jn?5! gt^^f^att gacggaatca agaacggtct ttcgaagctg ggcggctttg cgtcgtcgct 
iSso^ tt<=^g^t gtggggcgtg ccgcgaaggg cgctatcaac ggcg?gatcg a?ct?ctgaa 
ioset IJtTr.^ ccgaacaagc tgggttgggg caagctcagt atcgaLtgc ccgacaa^cc 
,of,, r ^ ^ attcgcgcca tgggtggacc ggcgtccggg tggacccgtg ttggcgagcg 
lolll allllllTn" ^^^^^^"^^ ttccgaacgg ctcgacggtc cggccgaacc atgcg?tgt? 
IiIai tntl^^^^ ggcgtgacgg tcaacgtgca gacgaacgcc gacccgttcg cgattgggcg 
altltltlT ^^^^^^^^^f ggacgtctcc ggcgtgatcc ggttcctaci tactcacgtg 
^^taggttga aggaggttat gaagtggcgg agttgagcga ctggacatgc gagtatcggg 
Ji22i llll^ gggtctgccc gactccgcca tttcgattgt cggggtcgac ggcttgc??! 

JlPfti tgcccga cgttcggtcg tctgatctga cgttggttca gcgcLtggg ctttgggccg 
lllA Tc:allllllt ^^=^^^^^99^ cggacggtca cgctgacgct tgaggta?ac gggcgcgacc 
^^^^'^^^^tt cacggaagcc ctgaacgccc ttcaggcggc tttcatgccg ggcgttgatg 
^46? caJ^^r*"" =^9^"--^^ ttcccgggcg oagcgtccga ccgaacggc? ??cgtgltgg 
JJsIl "^^55cacg caagcggtcg gcgccgctcg acctgaactt tgcgtatctg acgiglaala 
mil caatcir^^ gctttacgcg acgtcgcctt acatcgtggg cgacgctgc? cgL^ggtga 
illJi tJll^trj =^^^"5^9C gacaaggtcc caactgggct tgtgcttlcg gLgt?gtgc 
lllOl tlllUlttl ggaccggcgc cggacgaccc ggtaagccgl ?tcactLgt 

JNei '^^^^^^^'^^^ ccttccatcg tgatcacgga tgcggcttcc ccgtggct?g 

]ilt^ ttgatgacgt gacgggcgcg ttcttcgcca ttgattacga cggcacggtt gtcattoatt 
1^881 TJtTT ^^<=^^^"=5 aacgccgaag gctccgacat tcgcggg^tt atcgctgacg 
Ilia] ^^^^^^^5^^ gcctgagtat ggcccgggtg atcaccggtt gcggctlcgg agtagalacg 
ilnn^ f^'^^^^^gg^ ggcttcggct tcgctgacgt ggtcggatag gtgggtttga tgag?t?ct? 
^2^6? cl^f'^ ' caggacggcg tcggctatgg cgcgtctcal ?t?g?cga?? ggJaggtga^ 
12061 cgctacggcg cgcggcggct tccggcacgt cttcaagacn acg^cggagt tLtgLCa 
^2^8^ aaTJ.^"^^ acggcgcgca cggttgcggt cggctccggc acggt??t?a tcgggggcac 
1225l tllllitttt ^^"^f catggtcgag cggagagacg gttgccattc cggSgcgtc 
12lo^ ccgcgtaagg acttgatcgt tgcgcggctg acgacgtcgg cfgcggacgg 

^2361 aa™o^^^ ctcgcgattg aggtcgtgca gggtacgcct gccgc?tcg? cgacggtccc 
^f^^^gg^^g gacaacgcgg cggcaatcgc cattgttgat gtgccgaagg cgtcgaccac 
,111^ g'^tcacgctg accgtgtgcc ggacgtcggg tcagtacacg ga?caggcgg c?tacgggaa 
lltl] <=g?9tcgctg tgtatcgact gggcggcagt gctgccgtc? ccgtcggcg? tccJgSJgq 
12601 atJ^a^ff^ tacgactccg gcaccaatca gacatgggtg cggct?ga?t ccgg?gat?g 
12601 gttcacgaag gacccgggtc cgtggaagaa gtgcactccg cagaacgtac agqiaaaaaa 
2721 lllTtlTall gtaaccgtca cgggtgatct gtacgttcgg gaftcgLgc tglgltHH 
12721 gttgtcgggt cagctcaact tctctccgag caaggatctt gaagttttgg tgtacg?ggc 
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12781 gacgctgccg accggcatta cgcgcccgac tcagaacacc tatggggcgt cgggtcagac 
12841 gtacggaagc cggcgg gtggcgtcgg ccgtatcgcg |^^^gtcgt cgggctcgat 

12901 tgagtacggg acggcg tgattgcgaa cctgtacgtg I^P^acagt tcagtaagtc 

12961 cccgtggaat tcgtgagggt tgccgactcc cttcgtgtag gtagggggta tgcccagtga 
13021 ccgaatacga agttctacag atcgaagcca agacgggtga cgttatcgcg acgttgcccg 
13081 tcacgggcat taagtacggc gagacgctga acgctgccgg tacggcgacg gtaggaatgc 
13141 cgctcgacgc tgccgacccc gacacgcttc agcctgggcg ttccggcctt gtggtgctcc 
13201 gcgacggcga acccgactgg ggcggcttgc tgtggaccac gacggccgac cttgccgccg 
13261 gaacgctcac gctgaacgcg tccgggtggc acagctatta cgccgggcgg gtgctccatg 
13321 acgggtacga gcgcaagacg gatcaagcgc ttcttctcgc cgattggtac gcgctgtgca 
13381 acgaagacgg cggcattggc acggacactt cgcggctgac gacgtcgggg cgtgtcgtgt 
134 41 cccggttgtg gactcagtac gaattgaagg ttgtcgccga agccatatcg gagcttgccg 
13501 aagatgacgg cggcttctac ttccgctatg agacgtattg gcggagcgtt acccaggtcg 
13561 gtaaccgggt gctgaagtac acgcccggtt cggcgtcgac tccgttcgcg ctgacgcatg 
13621 gtgtgaactg tgacgtgact caggtgtctt acgactcagc ggcaatggcg actcgcgcct 
13681 atgcggtcgg cgccgacaac ggcaacggca cgaagcttgt cggcatcgct gacaacgccc 
13741 tagacatgcc gacgaagcac gtcgtgcagt ccttcagcga cgtgaagtcg actgaatcac 
13801 ttatcagtaa ggcatacgcc atagcgacgg ccggagccgc acccgtggcc attccgacgc 
13861 tgaccctgta tccgggcgcc ttcaagccca gcgacttcgt tccgggagct tcgggcgtcg 
13921 ttcaggtcga ctcaggttat gtgcgcgtct ttgacgactt cgtgattacg gagcgctcga 
13981 cgtcgattga tgagaacggc acggagctta cgaacgttgg cgcttgccaa taaggaggtt 
14041 ttcaccagtg gcgattaacg cgaacgcgtt gagtccgtcg cttgtcacgg aactgaacga 
14101 cttgaagcgg cggcttgctg cggtcgagcg taagcctgac gtgctcgcga agttcgaccg 
14161 ttacccgccg gtcgagtggt cggctatcgg gcgcggcatg gtgtccggca acgtgtggtc 
14221 atcctgcggg ctcgcgaacg tgaccggcct tgtcttcgac cgggttgagg cgaagttcat 
14281 taccgaccgg ctcattaccg ggcgcagcga agccgaagtg cgcctagcgg ctttcaagca 
14341 tgatctagac gccggagcga aggtgtgtct ttcggcgtcc tcaacgatcg ctcttaccgg 
14401 cacgacgtcc cgcgcgctgg gcacggttta cgtccgttgg cttcatggaa tcccgttcgg 
14461 ctgggacgct gaggacggcg cggctatcta cacgattgag cttcagcacc gttaccggga 
14 521 aggcccgacg ccggacgcgc acaatcacct tcaggtgtac ggcttcacga agcgcgagaa 
14581 cgtcgctgcc cctgacggtg gcgctcttgg catgccgaac aacacgaact atgcgacggc 
14641 cgttctgtcg gagactcgac cgaataccgg catcggatgg actaccgtcc cggacccgaa 
14701 caacctgaat ccgtgggacg ggtcctacaa catctcgaac atgcattact gcgttggtct 
14761 gcccgaagac cgcattccta cggctagcac cggcggtcat tggcgttggc gcggcactaa 
14821 cggcatgaag gttcgggacg ccgacattac ggaagacttc aatagctcat gagccttacc 
14 881 gatatcaccg gtcacgccga cgtaatcggc gcgcttctca tcggtgggtt tctcgtctat 
14 941 cagcgcgtgc gtaccggcac gggcaacgta tggcgcgacg aagccgaagc acagattgcg 
15001 cgctcacagc gcttgtcgga cgacctaacg cttctcatca cggaagtgcg gaatctgcgc 
15061 gatgagaacg ccgggttgcg gctgaagtgg cggagctgcg gcgtgagaat cgtgagcttc 
15121 gcgaccacat tgacaacttg attgggggcg cccgtggcga ttccgaatga gattcctacc 
15181 gtacgggtca cgggtactta cctgggttgg gacggtcgag cgctgaaggg cacggttacc 
15241 ttcacgggtc cgggccttgt gacgttccct gagtcggact tgttcattgc cggtccggtc 
15301 gtctgcacgc tcgatgagac ggggcagatc attgacgcca acggcaacgt cggcgttcgt 
15361 cttcccgcta ccgactcccc cgacatgaac ccgtcggaat ggacgtacac ggttaaggag 
15421 aatctgaccg gcgtcgttgg cgcgcgcacc tattccatgg tgctgccgaa ggacacgctg 
15481 aacaactccg tcgaccttgc cgacgtcgcc cctgccgacc cgactacgcc gacgtacgtt 
15541 gccgtcccgg gtcccagcgc gtacgaagtc gctgttgcgg aaggcttcac cggcactgag 
15601 gctcagtggc tcgactccct tgttgggcgg ggcgctcacg cgttcaacgg ctcgacggtt 
15661 ccggcggctt cgctggggct cgacggcgac acgtacgccc agttcactgt gtcgaccaca 
15721 ctcggcgtca gctcaacgac ggtcactatg tgggcgaagt cgggcggcgt gtggtcgaag 
15781 gtgtcggacg gcgttcgtgg cgcagcgtgg tacacgaaca acacgggcac cccgtcggcc 
15841 gacgtgcccg ttggtgacat gctccttcgg gtcgactccg gcgacgtgta ccaacgcggt 
15901 gcgtccggct gggacttgaa ggggaacatc aagggcgcga agggcgacaa gggcgacacg 
15961 ggcgcgacgg gcgccgactc gacggttccc ggtccccagg gtccggaagg gccggagggt 
16021 ccggagggtc cggccggtcc ggagggtccc cagggtccga agggtgaccc gggtacgggc 
16081 tccgtcaact ctgtgaacgg tgacctaggg ccggatatca cccttgccgc tgccgacgtc 
16141 ggggcgattc ccgttgccga caagggagtt gcgtccggcg ttgccacgct gggcaccgac 
16201 gggcttgttc cgactgacca actcccccag cttgccgacc cgaacgccgt gacgtcggtc 
16261 aacacgaagc cgggtccgac ggtcacgctt accgctgccg acgttggggc gctcgccact 
16321 tcaacgaaga acgctgcgga cggagttgcc ccgctcgact ccgcgaagcg cctgccgatt 
16381 gccaacgttc ccagcgccgt tccgaagaac agttggactc cgcaagcgct gggcttcgaa 
16441 gcgtggtcgg tctacccggg cggcgtggtg aacccggttg cgaagtatct gactccgcag 
16501 cgtctttacg tgacgggctt caacatcacg gagccgacga ccgtaaaccg aatcgtcatc 
16561 ttcgcgcgtg gttggggcgg cgtttcggct gaccgattca tggcgggcat ttacaaggaa 
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16621 agcactaccg gcaggg^^c ggtagttgtg aagtcggatt ccgtcgcact gccccaggct 
16681 gggcaggaaa ccggcq^^ tgccgctatg cggtcaacgc atgtcg^^^ cgttccgctg 
16741 cccattgcgt cgacc^^pt tcagcctggg cgctactggg tgacgtl^C tcaggtgaat 
16801 gggggcacgg ccgacttcgc gttctaccac gtgcagaacg aagccacgat ttcgacggca 
16861 aacttcttca tgacggattc gccctgggcg cgcgcatggt acgtgtccga taagacggca 
16921 cttcccgaca cgctgaatca ggcagcgtcc gacgtgcttg ccaaccatga cattccgatc 
16981 atggcgcttg ccaacgtctg agtctcaacc tactcacgtg agtaggtagc gccctggggg 
17041 caatcccgcc cccagggctt ttccatgcca tgaaacggag cgcgttttga gtctcgcaaa 
17101 ggttgtgtcg gtcgcgaagg ccgaggtcgg ctatcacgaa ggccggtcgg gcggacattg 
17161 gaacaaccac cagaagtatt cgcctgccgt gccggggctt gagtggtcgc agaatcaggc 
17221 ttggtgcgcg accttcgtga gttgggctgc tcttcaggct ggggagtcgg cgcactaccc 
17281 gcgcacggct tcgtgtgcga cgggcgtgaa ctggttccgg aacaaggggc gttggtcggc 
17341 gtatccggcg gtcggcgctc aggtgttctt cggcaacggg ggcggctcgc acacggagat 
17401 ttgctacgcg tacgacgctg actatgcgta cacggtcggc ggcaacacga acacgaacgg 
17461 cagcgctgag ggcgacggcg tgtatctgcg caagcgtgcg cgtcgggatt cgtacctgta 
17521 cggctacggc tacccagacg ttgcgggcgg ctccgtctcc gccgacccgg acgcttcgaa 
17581 gttcggttac aagcacaagg cgaccggcga cgttggcgac gttggcggct cgaccacgcc 
1.1^^1 gctgagccgt ccggagccgt cgcccggtac aaggtgacca tcaacggtct 

17701 tgagtacggc tacggggcac gtggcgccca ggtcacccag gtcggtaagg cgcttgtcgc 
i-ioo^" ^aagggcttc ggcgacgcgt acaaggacgg tccggggccg aattggtccg acgccgacac 
i^fo, gaccaactat gccgcgtttc agcggtcgtt gggctactcc ggcaaggacg ctgacggcgt 
17881 gccgggcgaa gggtcgctga cgaagcttct cggcaagctt cccgcgaagg cgaagccgaa 
r^ll^ ggcgtcgtat gagccgttcc cgggtgccgc atggttcaag aagaacccga aga.gcccaaf 
'"3^*''^'='3'3<=c at-gggcaagc gccttgtcgc tgaagggtgc tccgcgtacc ggtccggtcc 
18061 gggtgcccag tggacgaacg ccgacaaggc ttcttacgcg aagtggcagc gcaagcgcgg 
.III ^^^''^'='"3<3C gccgacgctg acggttggcc cggtaagacc acgtgggacg cgctgaaggt 
,oo!i" *=^^9^aggtc tgagcgtcga tcactctctg ttcgttccgc ggaagccggt atgtccgatt 
llll. ctgtacgggt tcgggcatgc cggctccggc tcaaatctca ctctctgtaa gggggcgcca 
18301 tgggcgacca cagcaagccg ggcggcaact acgcacgcgc cgcgttgagt tgggcgaagg 
18361 ctcacccgaa ggtcgtcacg tccgtcgtcg tcgcccttgt cggcgtcgct acggccgtga 
llAl agccggactt cccgggtaac gccgttctgt ctctcgttca cgccatcctg ggggcatagc 
18481 gccgctcagg ttgccgactc ccttcacggc atgcgtaagc acgtcgagcg aagggaacgg 
.l^tl cagtggccta ttacaagtcg atcggtctta tcgggcgcgc tcagtcgggc aaggattccg 
18601 tcggggcgcg tctccggcag cgttacggat atcagcgcgt cgcattcgct gacccgttga 
ifl^f^ f^^^^^""^^^ gctccgaatc gacccgctca ttccgacgtc gtacggcgta caaacccggc 
*^^9^9acact cgtgaatgcg gtcggctggg attacgcgaa ggtgacctat ccggaagtgc 
,oI!, gccgaatcct tcagcacgtc gggcagaccg ttcgcgatat cgaccccggc ttttgggtgc 
.111. ^^^ff^'^gtt ccccgctatc gacgctgcgg agcgcctgag cctgcccgtc gtcgtcacgg 
.11^. acgttcggta tgagaacgaa gcgcgcgctc ttcgtgaccg gggcttcagc atggttcggg 
iQofJ *^g«ctcgacc cggcacgctg aagccggacg cgcacaagtc tgagacggag ctagacaact 
,o«o, gggcgaccgc gctgactatc agcaacacgg gaacgctcga agacctcaac cggatcgttg 
Jq?5! actcactcct gttgccgcgc tcgcgctagc accttcggcc cccgccgact gaccttcacg 
9f gcgggggctt tcttgtgttc cgaaggtaaa gctttggtaa cgcacccagc 

llll] ctactcacgt gagtagcttg gagcgtgggc tagggtgagc gaagcaccac aacgacaacg 
io,oi ^^^^^gcggg aacatgaagc gggtcactct cggcggcggc aaggcggttc actactcgac 
^938? ggcttcatgg cttccccggc gtgcggcggc aaccgggcat cggagcgcta 

iq!!^ *=9tcccgacg gacgccgacg tgacgtgcaa gcggtgcgcg aagatccttg ccgctgaggc 
gg^gcgcgaa gagcgcctga accgtgaccc gcgcggtgac gaatggatgg ggcgcacgat 
19501 cggtgacgcc gtgaccgtga cccttcacgg ccggacgttc gacacggagc tgaccggcgc 
19561 cgaccacatc acgcccgggt ggacggtggc ctacgtggac gaagacgggc agcgcaacgg 
lltl, m^ll''^ gtggtgaccg acgccgacat tcaggacggc gacaaggtca gcgacccgcg 
llll, no™^^*""" ttcgacaagg ctcgcgcgct gggcatggac tgggccgaag cgctcgacta 
illnl ^g<="acgcg aagactgccg aaatggctca gcctactcac gtgagtagtg tagagtcggc 
lllVi nlttTJ^tt ^^^^^^5^^^ acaaggggac gggcaccatg gccacgaaga agctgaagct 
llltl Ittft^^f cggggcgacg tgcgcattgg cgccgtgccg ggggccgacg cgattcacgc 
^ctccggaac gccgttgatg agaacgggcg caaccttccg atgtgccgca cccgcacgL 
InnA ^^^^^^'^^att cagtattggg gaccggccgc tgagcagaag ccggaacttg agctgtgcgc 
20041 cgggtgctcg aaggtcgtgc cgaccggtga ggtttccgtc tccgaagagt ccgttgaggt 
20101 tccgggcctg agtatgacgg tcagtcagaa gagctacacg cccgttgagg gcgacgacaa 
loitl llttt^f.t.l'' ^f^^cagcga agaacgacac ccaggacgtt gacgctcaga tcagcgccgt 
5o?rJ ^^^'^^ggcac gtcgacaaca tcaagacggc cgagacggtt gaggccgtga aggaagccgc 
2034^ alltlt'"' ^^^59^^^^- tcacgaccct gccgacgaag caccgcLca cg?tt2gc?c 
2^40? gaagcccgca cggcgcgaga gacggagctg acgccggtca ccccggaagc 

20401 cgaagcggcg aaggccgaag tcgagtcgcg ccggtcggcc gacgtcgccg aagacttcaa 
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20461 

20521 

20581 

20641 

20701 

20761 

20821 

20881 

20941 

21001 

21061 

21121 

21181 

21241 

21301 

21361 

21421 

21481 

21541 

21601 

21661 

21721 

21781 

21841 

21901 

21961 

22021 

22081 

22141 

22201 

22261 

22321 

22381 

22441 

22501 

22561 

22621 

22681 

22741 

22801 

22861 

22921 

22981 

23041 

23101 

23161 

23221 

23281 

23341 

23401 

23461 

23521 

23581 

23641 

23701 

23761 

23821 

23881 

23941 

24001 

24061 

24121 

24181 

24241 



cgacattgaa '^^cgtgcccg 
cgacctgggg aagctca 
gcgtcagaag gtgaacc 
gacgaagaac gctgccgctg 
cgttgagcgg cagggggcgc 
cgtccttgtt gactgcgtcc 
cgcgtccgaa ctgttcggtg 
agccatttac cggctgtacg 



gcttgcccgg 
gctgacggac 
caaggaactg 
cgagaagagc 
caaggccggt 
ggagctgtac 
ggttacggcc 
gcgcccgggg 
ctctgttcgc 
cggaatgcgg 
cgtgcgctga 
ggtcagcacg 
gggctgagtg 
ggtaatcccg 
acaagagaca 



tacgaccgcc 
ggcgacaagg 
aaggccgaag 
gacgccgaga 
aagcgcttcg 
agcatcattc 
gacgaagacg 
gcttcgcccg 
cgacggcaag 
cacagcggca 
ggctgtacgc 
taccgacggg 
atgaagtgac 
gctcgacgcc 
gacactcacc 



taccggggcg gctcgcgctt 
gtgacgtcgg ttatcggcat 



atggcagcga 
cgcgaaggcg 
gacctgggca 
cgtgtgcgcg 
aaccctgaac 
tcgttcgatg 
tccgggaagc 
gtcgcccttc 
cgggaaccga 
gctttcaagc 
accttcgatt 
gcgtcggggc 
cttactcccc 
ttgagttcct 
ctgccttcgt 
cgctgggctt 
tgacgcccgt 
gtgccttaca 
acgtccttca 



cgcttgccgt 
ccattcagta 
gcgaagcgca 
ctgatctgac 
ttgtgcgcgc 
tggtcatgcg 
ctcacctgat 
agatgagcgc 
tgccggagtt 
cggtcgagac 
gggaccggga 
gcatggtgac 
catgtaggca 
ggcattcgtc 
cgtcatgatc 
cattgactgc 
gacgcgcgac 
cgggcgctga 
gcacaagtga 



gcgaagcgtt cgatctgggc 
gccgacgaca ccgttgggcg 



gtcgttcccg 
gctgtggctc 
ttcatcgacg 
cactgggata 
ttcgtgtcgc 
gatgagcgga 
ttcacgctcg 



tcgcgctcga 
agctcttcgg 
tgttcaccga 
tgaagctttg 
acgccgacga 
aggcagcggc 
ccgacgaccc 



ctcttcaagg ttctgcacga 
gtgctcgcca atctcgaact 
aacaagcttg tgagctacta 
attgccgact agtggcagcg 
gcaccgcttc tgacgacgcc 
gcgctgtgct gagtgagcgt 
cggaatacca ctaatgggaa 
cgtcggcgac ctgatcaatt 
gatcattcgg aagattgaga 
tcagccgacc ggcgttgact 
atggatcacg acagaacacg 



acctgatcaa 
ccaacgcggg 
cggcgaccgg 
aggtctacgc 
acaactccct 
gcgcgttcga 
acaagctcga 
ccgggcaggg 
gggtgaaggc 
acgccaaccc 
tgccggaaga 
agacggccga 
cgaaggtcaa 
gcgctgccgc 
agtgaccgac 
taaggnggta 
acgcacgccg 
ggcttcccgg 
gacgcctgag 
gcttcgtcat 
gctgtgactc 
acttcatcac 
cgctgaaggg 
ctacgtccac 
gctgccgaag 
cgactcaatc 
catctctggg 
cgacctgttt 
gccctacgtc 
cgaagatgtg 
cgtatggctc 
catgggcgac 
ttacatgaac 
cgacggcgct 
tggcccggac 
cggctcgcgg 
cggcactcag 
cccgttcgaa 
ggcgtcgtgt 
ctgattggca 
ctgtacgcgg 
tagggtccgg 

9gg99ttttt 

agcgggcaac 
cggagacgaa 
tttccactcc 
caagtggcgt 
cggcaccccg 
ccggccgaag 
gctgaacggc 
agagatgatc 
gaaggagtac 
ggagcttggc 
agccgaagac 
tgagcttgtg 
caagccgacc 
ctcgacccgg 
gtgaaggcgc 
gctcgacgct 
agcgcggaac 
acgcgacccg 
ttcggacggc 
ccggttacgg 
cacgcctgtt 



ggacggcgtc 

tgagaagttg 

tctgcccgac 

ccaggcgaag 

cgttcgcgcc 

cggtccggac 

cggactgaag 

aatcgagctt 

gattgagggc 

gaaggacgtt 

gatccttacc 

cgcgctgaag 

gacggcgaac 

cgacgcgttc 

cgaacacagt 

ggaccgtgcc 

tactgaagga 

tcgggggctc 

cgcccgtctc 

gccgtcacag 

attcccaaga 

tacgtcaccg 

gggcacaacg 

cctgagaacc 

caaaacttcc 

gacttcgttg 

gcggctcgcc 

gagcgcctga 

gagcacttcc 

gcgtggtcgg 

gacgccgacg 

tggaagacgt 

gccgacttca 

gccgttctgc 

gtcttcgccc 

aaggtgatcg 

cgacgcgccc 

ggagagaagc 

tcgtcgctgc 

tgtggcacgg 

tcggtctgac 

ccggaccaat 

gtgttgcgcg 

aacgcccccg 

gacaacaagc 

ggctacagcg 

atctcgaccg 

gttgagaacg 

tccccgtgga 

aagctgaagc 

ggtcagccgt 

gacgcgccga 

cgcttcaagt 

gacgttgagc 

gagtacacgc 

atcacggttc 

cgtcgccgga 

cactgtggca 

tcggcattcc 

ggtcactgac 

ttgcgggaat 

gtacggaaag 

cctgggtgag 

gcgcagcaat 



aagctcttct 
^^^acgtca 
^^^cggcgg 

aagcggattg 

actcagaaca 

cgcaaagagt 

gacgacgcga 

ccccgctacg 

gccacgaagg 

gaagcgctcg 

gagaagcttg 

gtgattcgcg 

gagaagcgga 

gaccttgacc 

gagtgaagcc 

cggttacatg[ 

ctcagcgacg 

gactccgccg 

cgcccccaac 

cgccgcacag 

ttcctaatga 

ggttgcccac 

tgccgaagat 

gtgaaatcgt 

ttgccccgtg 

ccgacatggc 

ggtacacgaa 

tccggggtga 

gggagttcct 

acacgtacgg 

gcaacccgac 

cgaaggcgac 

tcattgaccc 

acgtgactga 

agttccttca 

gtaagccggt 

ggtagcgggt 

tgtgcgccgt 

cgtcgtcggg 

ccacaacgcc 

gtcgcttctc 

cccacacctg 

ttgtaccgcc 

acactttggg 

cgaagaagcg 

agacgaatga 

gcgaacagtc 

aagagagcac 

tcattgaggc 

accactgcga 

gcgggtgccc 

acccggcaat 

tccagaccgg 

gcgtcggcaa 

cgaagcgtgg 

tgaagtcgta 

agcctgggca 

gtacccgccg 

tacggcagac 

tacgcgggtg 

cgcgcccgtg 

ctagttccct 

cgcaagacgc 

gtgacgggcg 



ctcagggcgt 

tgctgaccat 

agcgcaagac 

ccgacgacga 

aggcttcgga 

cccttgccgt 

gcatttccga 

gccggacgga 

aactggagac 

aagagaagat 

agccgaaggc 

cccaggtcga 

aggcgaaggc 

tgagcgcgct 

cctgggacac 

gtggttgagt 

ccccagacga 

acgtgcaccc 

agccccgtcg 

cggatgacct 

cttctccatt 

tgccttcagc 

ccgcactatc 

ccacccgggc 

gcaagccaag 

agcgcgcgac 

ggttcgcgcc 

gtacgtcggc 

gaaggccgtg 

ctatgccgga 

gccggaccgg 

gtatcccgac 

ggacggcaac 

cgagacgtgg 

cttgcgtcag 

cgcacggaag 

tgccgactcc 

tctgaggtag 

ctcgccctgt 

gccgtccccg 

gcgctcatcg 

agccccctca 

cgggttgccg 

agcactcatg 

tgagacgtac 

gcggggcaag 

cgttgcggac 

gtctgagaac 

tgacggtatc 

cggctttgac 

gaagctcttc 

caccgtgacc 

ttcttggacg 

gggtggcgct 

cccgatgcgg 

caacgacgcg 

cacaccatgc 

gaagcccgtc 

gacttcgatc 

aagccctgta 

cgtccgacgg 

tccttcaggt 

ttcggaagga 

aacagaacgg 
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24301 ctgaacagac acacgccccg ggcggagcaa tgcgcttcgt gccggga^t gtcgtcgttc 
24361 agcgtactca cgtgag^| ttgagagatg catacgggat tgttca^^ gcccgacgct 

24.421 gcgcctgccc tgggc^^^ gcgagcgctg aagccgggcg acacgglW tctgaagccg 
24481 ggcgccactg agcggaagga ctgggggcgc tatctcgacg cgctgagcaa cgccataacc 
24541 cggggcgcgt cggttgtgtg gtggggagag tgagtagcga agtgttcaat gcgttcgtcg 
24601 gcggcgtcat ggtcggcata tgtgtcggcg tgcttgccct tgccattacc gcaatggtgt 
24661 gcgcgcaccg tgagcgggtg cgtaatggcg catgaatcga agtgtccttg ccagccgtgc 
24721 cggaacaagc gccggaaggc ttacatcaag gactattacc ggaagcttcc gcgcgacaag 
IaII] ^^ccacaccc tgagtcagaa gcgtcgggcg acggcatacg gggtcgagca cgaagagtat 
24841 tcgcgaacag agatcatgcg gcgctggggc taccggtgcg cctactgcga cgcaaaggcg 
Itl^^ acccatcttg accacgtgca cccgctgagc aaggggggcg ccgacgctgc gcacaacatg 
Itlt^ cttccggcgt gcgcgaagtg caacctgagc aagggcgcga agacgcttgc cgaatgggcg 
ctgaccttcg gtccgaagcc cgccgactga gccgggttgc cgactccctt acgggtccct 
25081 gagagcaact acgaagggga gtcagtatgg acttcgcaag catcctgggg cgcttcaagg 
cagtcagcga agagccggac gggggttatc tcgcgctgtg cccggcgcac agtgattccc 
25201 gtccgtctct gcgcatatgg cggggtgacg acctgaaggt tcggctgaca tgccgtgccg 
25261 gttgtgacac gggcgacgtc gtcagctccg tcggcctgaa gtggtccgac ttgttcaacg 
oRoo, ^9^^999=9^ aggtctgacc gtcccgaaga gaagccgaag atggtaagtg gcgcgccggt 
tlll^ aacccgactt cgcatgtggc ttgagtcgct gccgctcacc caggatgccg ccgactacgc 
25441 cgccgaccgg ttcgggctcg acgtcgccca ggctgaggcg ctggggcttc gctactcccc 
25501 cgacgggcag ggttatgact ggcccgactt cgtgtcgacg tcgttcgccc ggttcccgcg 
25561 catggtcgtt ccgctgaagg gattcgacgg cgtgacccgt ggcgcccagg ggcgtgacct 
25621 gagcggcaag tgccccggtc gttggctgag cctgaagaac ccggacqqgc aacactgggc 
^^^^^^^Qyc gtgttcc-ggg gtgacgccgg ttacggcgtt gtcctgatca ctgagggtcc 
lilt. P^^^^'^gcg ctcactgccg tatcggtcgg ctatgacgcc gttgccgtcc ggggtgcgtc 
25801 tctggtcaac aaccctgagt tggtcgcgga gctggccgaa ggtctgaagg gctttcaggt 
lilt] '=^l^^^'3tgt ggtgacaacg acacggccgg agtcggcttc acgctgcgcc tgagcgaagg 
llll^ ^cttgccggg cacggtatcg acgcgtacgc gctgaacgtg cccgttccgg gtgacgacct 
llnl] ^^^^^^f^^g cgtgagcgcg acccgggcaa gttccccagc cgactgcacg acgccgtcaa 
26041 gtcggctcga cccgtccgcg accgtgccca ggttgaggcg gagcaccgta aggccgaagt 
it] J: ^^f^caccgt accggcgccg ttcaggtgtc gagcactcag ggcgccgacg ctgcgcgcat 
26161 cctgggcgac cttgtgtcga cgtacggcga gagtgacgcc atgaacgctc acgcgcttgt 
nil] ggcatggact gacggccgga tcaagtacgc gtccggcctg ggctacttcg tgtgggacgg 
Itll] gtgaagtcgg caacgcgcgt gcgtcaggag attcacgcca tgggcgcggc 

Itlt] ?<=ttgtgctc gccggttgcc tgccggagtc gcgcggcttc accatgacga cgcgcattga 
26401 tgcactcatg acggagcttc gcagtgttcc cagcgtgcac gttgaggcgg aagagttcga 
26461 cgcgaacgcg cacctgttga gcttcgcgaa cggcgtggtc gaccttcgta cgggcaagct 
26521 ccgcgcgcac gacaagggcg acatgctcac tgtgtcgctg ccgatcgagt acgacccgaa 
Itll] ^^cccaggct ccgcgttggg aacagttcct tcaggaaatc ttcccgaaca atgctgacct 
It^t] ^^tcggctac atgcgtcggc ttgtcggcta cggcattacc ggcaatacgt ctgagcagtg 
26701 tttcgcggtg ctgtggggca agggcgccaa cggaaagtca gtgttcacgg agacgctgac 
26761 ggacgtgttc gggcgcatca cgaagacgac gcccttcgcc acgtttgagg acaagggcaa 
26821 cggcgggggc attcccaacg accttgccgc gcttcgtggt tcgcgcctig tcatgj?gtc 
26881 cgaaggcgag tcgggcaagc ccatgtcgga agctgtgctg aagcgcgtga cgggtaagga 
270^1 mitlt^"'^ gcgcgattcc tgcggcagga attctttacg ttcgcgccga cg??cctg!t 
l^ni] accaaccaca agccgaagtt caagtctcag gatgaggggc tttggcgtcg 

27;?^ Talt"J.T. ^"'^"^'^'^^^ tgcgctactt cgcgccggaa gagcgcgact acgaL?tga 
27^8^ arllttrn^n ^^^^^^^^^t cggcgggcat tgtggcctgg gcggtgcgcg gtgcggtcga 
27181 gtggtatgcg aatggcctgg gcgacccgga gtcgattagc accgcaacgc gtgagtaccg 
27241 ggcgacgtcc gatgcgctcg ccggtttctt cccgggcgtg ctcgacgctg Lga^gact? 
llll] ^^^5^"f 9 tcgggcgctg acgcgtacaa ctcttaccgc gattggtgtg aggctgaggg 
27421 f 5^^^9tcc actgaggttt ggtcccggaa ggcgttttac ggcgccatgg aagag?gggg 
2748? n^^t^'^^^^^ aagaagacca acacgggtat tgcgcttgtc ggcgtgaagt tcgc?ga?gl 
lltl] Tr.t^t"'^"''' gctaccggtc ccggcatctt cggcaaggac tgacacctag gcgcc?tcfa 
nil] =^^^^^^=^^9 tgagtagctt gggggcgctt cggcattccc ccgggttgcc gactccctta 
27661 lirT.''^'' r"^^^" gttgccacgt gatcgagtat ca??acgacg ??cgcggcga 
277^? alnll gtcttcatcc cggagactga gcgcgacctt cgtgagttcl tgcac?gggc 

27781 coaata™ ^^<=9^^<="g cgttggacac ggagacgacc gggctcgcca tgtactcaag 
2784 t tntnlt ^ ctgcgcacgg ttcagttcgg cacggcgcat gaagcctggg tcattcacta 
llloi llttll^^'^'' ggacgcttcg ccgaagccgc cgactacgta ctgaagcact gcccccggtt 
Zllei aaltrtt.t'' ^^^^ccccgt tcgactggct tgtgttggac gcgcatacgc ccgtgtJLt 
Hot] ^5^^^=9Cta gcaccgcgca cggtcgacac gaagattaag gcgacgctga ttgacccgcg 
28^8? ccJat^-^^" 5^^33=99ca ttgggaccgg cctcaagccg ctcag?gcgt tcLcgtIg! 
28081 cccgtcggcg ccggacactc agggcgacct aacggcggtc ttccggtcgc tggggctcac 
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28141 

28201 

28261 

28321 

28381 

28441 

28501 

28561 

28621 

28681 

28741 

28801 

28861 

28921 

28981 

29041 

29101 

29161 

29221 

29281 

29341 

29401 

29461 

29521 

29581 

29641 

29701 

29761 

29821 

29881 

29941 

30001 

30061 

30121 

30181 

30241 

30301 

30361 

30421 

30481 

30541 

30601 

30661 

30721 

30781 

30841 

30901 

30961 

31021 

31081 

31141 

31201 

31261 

31321 

31381 

31441 

31501 

31561 

31621 

31681 

31741 

31801 

31861 

31921 



gaaagaaacc 

gctcgacgtg 

gagcatccgg 

gcaacgcgcc 

cgaggaagaa 

cggcgcccag 

cggcggagca 

ctgggaacgt 

gcgcgcgggc 

cgggcgtatt 

cggcgacttc 

cggagacgcc 

gttggcagcg 

cttcgatatt 

ccggaaggtg 

tgcccgacag 

ggtcttccct 

gcttgtcact 

cgtcgtgaac 

catgcgtgac 

cagcgccccg 

ggacttgttc 

gctgtatggc 

gtcacgcgcg 

ccccacctaa 

cccgtacatc 

ctccgccggg 

ctgttacgtt 

cacccggaag 

cagtgacccc 

ccaagggaga 

ccttgccgac 

cgcccttgcc 

ctgggcggac 

cacggacacg 

gaaggacgct 

ggtctacgcg 

acagatcatc 

cgcgtggcag 

cggttccctt 

gaaggtcggt 

tccccgtgac 

cgagacgact 

gactgagcgc 

gaccgaaggg 

gcgcgagaag 

cattctcaag 

cgacatggaa 

cccgaaggcg 

tgaggcgctt 

caagtaacag 

tccttactgc 

gaccggccac 

cgccgacggc 

caagcggctt 

aagagcgccg 

cccggagcac 

aggacacggc 

tgaggacgaa 

gcaccccagc 

attcgattcg 

gtccggcttc 

cggtcaagtg 

cgaacaactg 



g^tgggcgg 
l^^^atacgg 
l^^^cgctac 

gggctcgcgc 

gagaagcacc 

gttgccgaag 

ctgaaggtcg 

atcggggcgc 

aagtgggtaa 

cacccgaaca 

gcggctcaaa 

ccggatcaca 

ctcgccgacg 

cacatgtaca 

ttcaagggcg 

accggagcga 

gagatcaaac 

gtgtccgtca 

tatcagtgtc 

gctgggcttc 

aaggcggacg 

ggcgtgcccg 

gccgacgtgt 

aataacgatg 

gccccgttac 

tgaaggatga 

tctctgtggg 

cgttgagcac 

ggtcgccccc 

ggtcactccc 

aacccagtgc 

cggcttgccg 

cgtaaggcgg 

gaattcaccc 

accgttgacg 

gtgcgggaag 

tccatgctcg 

ccgccgaagg 

ggtgccgtgt 

gccgacacgc 

cggggcgcac 

gccgaagccc 

ccggcggacg 

cggtacgtgc 

gaagttgccg 

cacgcacgcg 

cactccttcg 

ggcatgtgcg 

cgtaaggcgt 

gccgctgtgc 

accacccggg 

caaacacaga 

accgtcacca 

gacgtgatca 

gcctgccgca 

caacccggga 

tcccgaacaa 

acggcgcttg 

gacgccgtta 

cctttcgccg 

cgcgacgcgc 

cgggtttggg 

gtcttcagtt 

tcgctcttca 



gtatcgacct 
cgcgcctgaa 
ttgagtacga 
ttgatcttga 
tttacgccgc 
cgttgcttgc 
acaaggccgt 
gtgagcctaa 
cgacgtacgc 
tcaacacgct 
cgctgccgtc 
tcatgggcag 
tgaagcgcat 
cggctcagct 
ccggattcgg 
ccgaagcgga 

gggcgtcgtc 

cgggtcggcg 

agtcggcagc 

tcgactacat 

caaaggacat 

tcgtcgccga 

gaccctccgt 

ctcaacctac 

cgatgaacct 

cgtcgggtta 

tacatcgaag 

gacggaccgg 

gctttgcgca 

tgcccgggtt 

tgaccattga 

ctgagcgcga 

ctcagcgcat 

aggttggccg 

cgttcgagcg 

agcgcaacgg 

aagccgccga 

gcaagcgact 

cgctcgacaa 

tgaagcacta 

tcatcgaagc 

gtacgtgcgt 

ttgcggcgct 

tcgacgccct 

acgaccttcg 

ttaacgactg 

gcatcggtgg 

cccaggtcgg 

tcgctaagcg 

tcgaagccgc 

gcggagctga 

caacgaaacg 

ctcagcgcgt 

gcaccgttca 

cccgataggg 

cgcgaaagcg 

cacaacgacc 

actccggctt 

agcgcgtgtg 

acgacgccct 

tgaacaaatg 

aatacgacgt 

cgttcgaagc 

agtgaccgac 



tcggcaccct 

cccgtgcctt 

acatgagatt 

gtacgtcgac 

tgccatgtgg 

catgggcgag 

gttgctgccg 

cccgttggcc 

cgaccggttc 

tcaggcacgc 

gtcggattgg 

cgtcgacttt 

gaaggacgga 

catcaagggg 

caaggtctac 

gattgcccgt 

gcgttggcag 

cctgccgctc 

gcgtgacgtc 

gaagttgccc 

tgcgcgtgag 

cgctgaccta 

ggatttgccg 

tcacgtgagt 

tcggtaacgg 

cgccttcgaa 

gaaaacatgc 

aactgatcac 

tcactgtgca 

gccgacttcc 

gacgattcgc 

agtgatcgaa 

ggctccgcat 

tgtggccgtg 

ctacgtgtac 

caacgccggg 

gggcgacgtg 

cagcaaggaa 

ggtcacgtcg 

cgacgaagag 

cgcctacgtg 

gctcgatgcg 

tgaagacgtg 

tgccgtcctt 

tgacgttcgt 

cattgagtcc 

cgtgaccgac 

cgtgacgtat 

ctatgtcgct 

tgccgctgag 

ccacgcacgt 

gagccccaca 

cggcgccaac 

gcattccttc 

cttcggccca 

tgcacgcgga 

gaaggggcac 

cccgtccggc 

gggcatggcg 

ttatgacatt 

gttcgacgga 

tcccgactgg 

cgttgaggtc 

ggaattcctg 



acqtacaacc 
I^Mzrcgaac 
l^^^tatgt 

acccttcgcc 

ggtgtcgact 

acgctgaccc 

cttgccgacc 

gaagccgtcc 

gcgaacaacc 

acggggcgca 

atgattcgcc 

caggcaatcg 

ttcgttaacg 

cttgaggcga 

gggggcggag 

gcggtcgccg 

cgtgaagcgc 

gaccggaacc 

ctggggcaag 

attcacgatg 

ttcgagaagt 

gggggccggt 

aacctaacgt 

aggttgaaga 

ggcttctgtc 

tcctcagcaa 

ctactcacgt 

cccccgtcgg 

aagcgctgcc 

ttacccgaag 

gccgcccagt 

gcgaccgact 

ggtggcgcac 

tgggactgcc 

gcgaccgttg 

gccgacgaaa 

tacgaagccg 

cgcgccgaag 

gctgagaacg 

cccgacggcg 

cttgagcggt 

cttgagcttg 

ctgacggtcc 

cacgccgccg 

gacgaccgca 

atgggcgcca 

tacgggcacg 

gtacagctca 

gccgtgaagc 

cgactgacga 

tccgccccgg 

gtgcagacct 

gtcgagttcg 

gctgagtccg 

tggctccggt 

agccgccggt 

gacgtgaagg 

ccgtacacgt 

agtgagcaca 

gcgtcgtttg 

tggactgacg 

gcggctcgcg 

gcgtcgtacg 

gggcgcgttc 



tttacgccgg 

acgcgcgcct 

gcgcgtacat 

gcatgctgcg 

ccgtcaactc 

agcggacgga 

ttgaccggga 

ttcgagcgaa 

acgatccgca 

tgtccatcaa 

gcgcgattgt 

aaatgcgcgt 

gcggctccga 

cgaagcgcga 

tcgccacgat 

agtatgaccg 

gcggcacggg 

gtacgtatgc 

ccatgctcaa 

agatcgtgtt 

gcatgaccat 

cctgggggtc 

tccgtcagtg 

tcatctgaca 

atatgccatg 

cgttcccagg 

gagtagtcac 

caggagtcgg 

gtcgcacggt 

tcgagacaac 

ccgccgacga 

cacgcgttgt 

gctttgccga 

tgaagcgctt 

acggcacact 

acgccgtgaa 

cccggctcgc 

ccgcccggct 

cggacgccga 

agattcgccc 

acgtgtccgt 

ccacccaggg 

ccagcgaccc 

tgtcgacgtc 

tggccgattc 

ctcagcgcga 

gtgacgggtg 

agtcgtaccg 

tgaccggtgc 

acgccgggcg 

gttgccgact 

tcacccttcc 

tcacggccaa 

tgcccctgat 

agctcagcgg 

tcgaatccgg 

tgtggcgcgt 

gcgaaggcgt 

gcaactcgac 

agcggtgcgg 

cactcgacgc 

tcggcaagtt 

gattcgagcc 

atggtgcgcg 
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31981 ctcttcgttg cacaagSC&t gaaacacatc acggagggaa acggaa tggq agtgaccaca 
32041 gagacgaagg cggag^^a ggctgagcgg gggcgcgtcg tcggct^^ gaagtcgtcg 
32101 gagccgggca agatc^lB tgccgttccg gccgaccgtg ccgtgal^i cccggctcag 
32161 gcgcgtcagc ttgccgcatg gctgaacgag gaagccgacc ggggcggacg ggtatcgacg 
32221 gacgcgcgca ccgctgaggg atggcgcacg gctgaggctg agcggcagcg cgccattcgc 
32281 gaagcgctca acgtcgaccg ggtcacgacg cccagcggac ggacgttcac gcgccgtgcc 
32341 tactgagccg aaggcgccac gcaagacggc gtcacagaag cgcgctgaag cgtccgccga 
32401 acggctcgac gcatggaagg gcgcacacaa cctactgaag ccgcacgact gggctgaggg 
32461 gcttacgcct tacgacgttc tgtcgctcgc ccagtggctt cacggtgccg actcgaccga 
32521 ctgacgacct ggggcgctca acctagtcac gtgagtacgc tgagcgcccc gtttggagag 
32581 acacccttga aggtcaatgt tcttgccacg acggcgctga acccttcgcc cttgctcgac 
32641 gcgtacgagt atcgcgtgtc cggcgctgcc tacaaccggg accggccgac tgacgctgac 
32701 gccctggggg aagccgccgg gcgtatctgt tacaagtcct tcgagcgcaa gaatccggcg 
32761 acggcttcga acccgggtta cctgggcaac atcctcgctc aggggcactt cagcgtgctc 
32821 gaacatgcgt cggtcacctt ccttgtgcgc gacgtctcgc gtgcgcttct gacggagctg 
32881 agccgacacc gacacctgag cttcagcgtt gtgtctcagc gctacgtcga ccacgccgac 
32941 actgagccgg tcgtgccccc tgccattcgc ggcacggagc ttgagaagcc gttcagggag 
33001 gattacgccg aagcgcttca ggcgtacgac gctggggtga agcttcttcg agcccggggc 
33061 tacggccgga agcaagcccg cgaagccgcc cgtgcgctac tgcccaacgc cgcgccggtc 
llll^ taaccgggaa ccttcgggcg tggcgtgacg tcctgggcaa gcgttggcac 

33181 gttgccgccg acgctgagat tcgggagttc gccggtcggg tgctcgacca ccttcacgcc 
33241 gtagcgccca actccgttca ggacatgccg acttcgccgt tcgggagtga tggcaagtga 
33301 gctgtacgga gtgcaagcgc gcgaccggtc acaagctcga ttgtgggcag cgcgaaccga 
33361 acccgttcct tcagctcgac gctg-cggcaa tcgacgtcat tgacgacatg gttgacgaat 
33421 ggcttgaccg ggaccggcac ggcgaactga acggcgggta cggctacggg ccgaagaaag 
33481 accgcgcgct gagccgcata cacgaacaga tcaaggaagc atggcgcgcg aaggtgcggt 
ll^t^ atgcggaccc ggaggaaacc gagtgaagcg cgttgccgct gccattgcgg gtgtcgccct 
33601 tgtcggcgce gtgaccgtcg ggtgtgaccc cgggccggaa tgcatcgagt cccacagcga 
33661 aatgacctgg gtgcccatgt acaacggcaa gacgacgacg cttcagccgg tatggacaac 
IzZz cgtatgcacg aagtacgaga cggagacgcc gaagtgatca gtgccgtaat catcgatctg 
ool!,^ ccgtcgtcgg gtcgctcgac cgtgaaggcg ttccgcgaag cgtgctgggc gctcgacgtc 
33841 gaacctgagt ttgtcgacgt gacgtcgctc gactgtcggg ccgacggcgt tacccgggtg 
oof^, ^^^accgtgc gcgtgtacgc cgacgacgac ccttacggcg acgtgttggc ggagcatcgg 
33961 gggaaggcga ccggcgaaga gatcacggcg cttctgaacc ggggtcttgc ccttgtctga 
ccgaccgtcc tgggacgcgt actttctcgc gggtgctgcc tgggtggcga ctcgcgcgga 
,^?!; <=tgtacgcgt tcccaggttg gcgccgtcct tgtgaacgcc aaccacgaag ttcggggcac ■ 
o.ol gggttacaac ggtgcccctt ccggcgtgcc cgggtgtgcg tccgccggag cgtgcccacg 
34201 tgggcaactg agcgccgtcg agtgcgcgcc caactccgac tatgcgaact gtgttgccga 
34261 ccacgctgag cgcaacgcga tacgtcatgc gccgtcggcg gagcttgccg gagccacgct 
lAl^ gtacacgacg cgcgaaccct gcccggcatg ttggacgttg atacgggcag cgggtatccg 
ItA^ gcgggttgtg acgccgacca cttcgcacac tttctgagcc ccgggggtgc ctactcacgt 
VaU] gagtaggcag gtaggctgaa gctccgccca gcaacgaagg agacacggac atgatcgcga 
illi} ^cttttggga agacgcttcg gccgtgattc gccgcgcccc gcttgaggac gaaagcgccc 
VAt. "^^^^f ^gct cgccggtccg accctgccgg acggctccgc tggggaaatg gtgagtatga 
V.tl^ cgcttgagac ggcgcgcaag ctccgcgacc gactgaacgc ccagcttgcc gggttcgagc 
34 681 cgacgccgac tgagccccgt tgcacccggc acggttcgga gtgcgaccgg gacccgaaga 
IaII. ^^fcccacat cttcaagcgc taagcctggg aagcccctac ctactcacgt gagtaggtag 
lllV: fSgctttttc gtgttcaacc tactcacgtg agtaggctac tgtgaagatc gtttcacgcg 
lllt^ ttcacaagct gcgaccgcaa cgaacaaaac ggctgggatg actcgacgcg ccggttgtac 
lilt] '''''^f^f^^^a tgagacgccg atcacccgct aggagaatcg tttgaagcct caacatggcc 
g^atatggat ctacccaatc cgcttcgccc tggggggtgt cgagtaatgc cgcactcagt 
lllOi tTT'^T ^^^^^^"^'^ ctgtctgggt cccgtcaggt gctgcccgig cgtggctt^c 
35101 ttgctccgtc ctgggcgggg cactgaccga cgaactgatt cagtcatgcg acgaactgaa 
35161 agccgtgttc aaggcacacg gcaagttggt tgcccggctt ctgagttcgc cgtcagcgcc 
ccggtacgac gggttcagga tcatcgggcg ccggaaggac acgggcgcca tggttgccgc 
35281 tgtggagtgg gtacggagtc gcgagacgcg tgagcttgtg cgcggctccg tcatctggac 
lllt^ ttt^^'^l^^'^ tacgtccacc cagcgacggc gcttgttgcg tgatcgttgc ggaatattcg 
35401 attccgaacg cgtgcaccat gcaagcattg aagccccaaa ccgctcatac aagatcatcg 
lltt] gtgacacatg cccaacaaga tcactttcgg cgcgtccgta ctggcgtccg cagcaacggc 
35521 gttcggcctg ggagcgctcg ccttcacgtc ccccaacgcg ccggagacgg catacccgct 
35581 gccgactcac tcagcgccgg atgtcgagcc ggagacggcg cccctgagcg tcggcgcgac 
35641 cgaagacccc gggaccgtgc ccacaatcgc cccagtggct tgtacagcgc ccagctcgac 
35701 cccgaagacc ggtcggcact cgaagccccg tacggagccg gagacggacg acacggccac 
35761 gctgccgcgc catgcgaagc ccttggcaac gccgtcgccc agctcgacca ctccgggaag 
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35821 
35881 
35941 
36001 
36061 
36121 
36181 
36241 
36301 
36361 
36421 
36481 
36541 
36601 
36661 
36721 
36781 
36841 
36901 
36961 
37021 
37081 
37141 
37201 
37261 
37 321 
37381 
37441 
37501 
37561 
37621 
37681 
37741 
37801 
37861 
37 921 
37981 
38041 
38101 
38161 
38221 
38281 
38341 
38401 
38461 
38521 
38581 
38641 
38701 
38761 
38821 
38881 
38941 
39001 
39061 
39121 
39181 
39241 
39301 
39361 
39421 
39481 
39541 
39601 



ggcagcatga^^ctggaat 
tcgtcaagaa^J^tgccgtc 
cacaaccacg^^Bccgacgc 

ccgtatgaag aagcctgggg 
gacaaggaac tagcggaagc 
tcacccgccg acacctgggc 
gtttgggact tgagcggcgg 
gaccgtcaga agagcaaaaa 
gcgattgtgc gaacccggcg 
gtgttctaca gcctcatgac 
tccggctggg tgctgggctt 
gggacgttgg cgaagtacgg 
atcaccggtc tgtcgtcggt 
gtcggcgtgg cagtgcactt 
cccgtgtaca tgaaggcgct 
gtcgcggaac ctgagccggt 
ccccaggccg ttcaggacgt 
gacgacgacg ttccggcgcg 
tggcgtcacc ggctgaaccc 
gtgcggaaga agttcgccca 
gactgccttc gggttggtcg 
ccggttatgt ccgttttaac 
gcgtgaccgt cttaccctgg 
ttccgggtgg attgcgaccg 
catgtgcgaa gacctaggca 
agccgaaggc atccttgaag 
aatttcctgg gtgatcgaag 
cgccgctgag gcactgaccg 
gctcgcgcac attctcttca 
ccggtttgaa gtcgagcggg 
gctctccccc aactgaggga 
acagcgtccg cctgagtggc 
cgtgcgcccc tgggtgatct 
cggcgacccc tcaaccctca 
gctaatgcgg tgctaccgga 
gtacaacgtc gcgcgtgagc 
ctacagcgcc ggaaaaacgg 
gatgaagtca tgggcgatta 
ctcgaatggc gttcgcggtc 
cagatgtggg agcgcacggg 
gtgccggagc aatcgccctg 
acacaccgaa gccccggcgg 
gtcagaagcg gttttcggga 
gggggcgtag ggtcgccgac 
gtgcttacga ccgtcagtcg 
agcgtagcgc caacgaagac 
gccggttcag gttcgtcggg 
cggagcgccc ggagttcgaa 
tcattgtcta tgacgtgtcg 
tctcggaatt gctcgccctg 
agggaaacgt catggacctg 
cttcgctgaa gtcggcgaag 
acgtcggcgg gaaggcgcct 
gcaacggccg aatggtcaat 
ccggaccctt cgagttcgag 
acaaacacct tcccttcaag 
ggctttgtaa gcgcatggac 
agaccgcttc aagcgcctgg 
ttgcgggctt cgccgctgag 
agattgaggg ttaccgcatt 
gcggaccgat catcgagccc 
ggcgcggcaa ggggctttcc 
gcgagtgtgg cgccgtcatg 
gctgccgtcg ccggaaggtg 



cgatatgccg 
gtcttcggcg 
tgagacgccc 
cgacaccccg 
cctgggcgtg 
gcgcgaggaa 
caacggttgg 
tcctcaactg 
ggtgctgacc 
gacgacgccc 
gatcgttgat 
cgtgacgaag 
cttcctgaac 
gatcgctccg 
tgccgacgct 
accggcggag 
ccttccggag 
tctgccgaac 
ggttgaggtc 
gctcgacgcc 
gtagggggca 
ccagctcaaa 
gcgccatggg 
gccgatacgg 
tcgggcggac 
tccatcaacg 
atcacggagc 
ccggagtgcg 
acgtggtcgg 
ggcgcacctg 
agtcccctag 
ggcaggaact 
tcgacggagc 
tgccgctcga 
cgtggggcga 
gggtcggccg 
aacctgggtc 
ctgcgcgctg 
ggcggcgtat 
gcgagccgct 
ggtgggttac 
caaccctcag 
gtagtgcccc 
atgacacaag 
cgcgagcgcg 
aaggcggccg 
catttcagcg 
cgcatcctga 
cgcttctcgc 
ggcgtgacga 
attcacctga 
attctcgaca 
tacggcttcg 
gtcgtcatca 
cccgacgtaa 
ccgggcagtc 
gctgacgccg 
gacccggcaa 
gtgatctaca 
cagcgcgacc 
gctgagtggt 
<^99gggcaag 
acttcgaagc 
gtcgacccgt 



acgtcgggca 
tcagtgagcg 
ttgatcgctg 
gacactgccg 
ccctactcgc 
gccgaagccg 
ttctagtccc 
acgtaagggg 
ggggggcgtt 
ttcgtcagcg 
gcggcgttca 
ctgggcgcgt 
gtatggctga 
gcgcttgtga 
gagcgggaag 
cctgagcccg 
ccgacgccgg 
gcccaggcaa 
gctgccgctg 
gaactgagcg 
tttttcgtta 
aatgccggct 
accgaagacg 
accgggcgac 
ggcgctccgt 
gagtgcctac 
caacgaagca 
ctccgcctac 
cccgttgcgc 
ggaagcccgc 
cgtgttccct 
acagcgaacc 
catgaagggc 
atgggtcacc 
agtgcccctt 
gtgaagagat 
gtgcgcaaga 
ccccaggacg 
gactggctcg 
gacgtcccga 
acgacgcccc 
cggatgcccc 
aactggggta 
gggttgtgac 
agaattcgag 
acc ttcagcg 
aagcgccggg 
acgaatgccg 
gcctgaaggt 
ttgtttccac 
ttatgcggct 
cgaagaacct 
agcttgtttc 
acaagcttgc 
tccggtggtg 
aagccgccat 
tgccgacccg 
ccgttatgcg 
agaagaagcc 
cgatcacgct 
atgagcttca 
ccattctgtc 
gcggggaaga 
ccgcacctgg 



«aaggtc 
cttcga 
cggctc 
ccgacccggg 
cgacgtgcta 
acgaccaacc 
gggttgccga 
caagggagtt 
ggttcctgat 
cgcacagcga 
tcatggcgct 
ggcccgtcgc 
gcgtgtcggc 
tgcttctcgc 
ccctgagcgc 
tcacggagcc 
agcctgagcc 
acaagatcat 
ccggtcggca 
tgtgagacgt 
cccagggtca 
ccggcccaaa 
caagcggcgt 
aagcttcctt 
caggtgctcg 
cgggtacctt 
acggcgccga 
gccgacgaag 
atgaagcttg 
gaaggtggaa 
atgcgtcggt 
gaagacggca 
tattgcgcgc 
cagcatggcg 
gtcggcgggg 
acagggtcat 
atccgccgtt 
acgacggcga 
cccactgcct 
aggcgtggcg 
tctatggccc 
ggggcttcac 
acctttgagt 
c^ggggtggac 
cgcagcaagc 
cgaagtcgag 
cacgtcggcg 
cgccgggcgg 
catggacgcg 
tcaggaaggc 
cgacgcgtcg 
tcagcgcgaa 
ggagacgaag 
gcactcgacc 
gtggcgtgag 
tcacccgggc 
gggcgagacg 
aatccttcgg 
ggacggcacg 
ccggccggtc 
ggcgtggttg 
cgccatggac 
atcgatcaag 
gcagcacgaa 



ggcgcccatg 
accggcgggg 
gcctgagtac 
cgttgtgtgc 
cacgggccac 
cgccggtccc 
ctcccttact 
gggcacgaag 
cctggggctt 
atgggcatgg 
gagcgctgaa 
gttccgatgg 
tcatgactgg 
cgaagtcggg 
ccctgagccg 
ggagacggcg 
ggagcctgcc 
tgaggaaggg 
ccctgccacc 
gcccccgact 
ttccgcgggg 
tttttactca 
acgtgatcat 
ccgaacgcgc 
cgaagctagt 
caggcatgcg 
ccatcgatag 
acacgacgac 
tgaccgatgg 
ttttcgtcac 
atcgagtcac 
acccgcccgt 
tgcccgacga 
cccaggcgtg 
gcgccgtccc 
ggaacgcgcg 
cgtgatcttc 
gccggtgacg 
tcagacgtgg 
cggcttcccc 
gtactgacgg 
gttttcccag 
tctctcagtt 
acgtacgcgg 
ccagcgacac 
cgcgacgggg 
ttcgggacgg 
ctcaacatga 
attccgattg 
gtcttccggc 
cacaaagaat 
ttgggcgggt 
gagatcacgc 
actcccctta 
atcaagacgc 
agcatcacgg 
attgggaaga 
gacccgcgta 
ccgaccacga 
gagcttgatt 
gacggcaggg 
aagctgtact 
gactcttacc 
ggcacgtgca 
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lltt^ ^''^^''^^^^^ ggcgg^tc gacaagttcg ttgcggaacg catctta-^c aagatcaggc 
cgacg^^ acgttggcgc ttctgtggga agccgJ||i cgcttcggca 
Itlll t^tl^^t^^ ggcgcj^ aagagcggcg aacgggcgaa ccttgSPb gagcgcgccg 
iq^m «^g^=ctgaa cgcccttgaa gagctgtacg aagaccgcgc ggcaggcgcg tacgac^gac 
39901 ccgttggcag gaagcacttc cggaagcaac aggcagcgct gacgctccgg cagcaagggg 
Inlt] ^-^^aagagcg gcttgccgaa cttgaagccg ccgaagcccc gaagcttccc cttgaccaa? 
4onfti ^f^^*'"'*'*'^^ agacgccgac gctgacccga ccggccctaa gtcgtggtgg gggcgcgcgt 
40081 cagtagacga caagcgcgtg ttcgtcgggc tcttcgtaga caagatcgtt gtcalgLgt 
tnln] *=9^^^^^9gg cagggggcag ggaacgccca tcgagaagcg cgcttcgatc acgtgggcga 
40201 agccgccgac cgacgacgac gaagacgacg cccaggacgg cacggaagac gtagl|gcgt 
40261 agcgagacac ccgggaagcc tgttaggcgc tgagacgggc gcacagcggg cttcc?gggg 
Toll, ggtcggccgg tcccccggtc ggcccatttc tcttgtc??g gtttag??ag 

J^Ji llttltn^t'^ taacagtgac tccgtcacca cagcacagcg gggcgagccg ttgacctggg 
IntV: ^^aagtgatg ctgtgacgga atgactcgaa acacacattc ctaatgactt ctcattgggt 
tolsi ItltT^tt'' ^"^5-="^^ tcatcacagc gtcacccggg cgccc?tcgc tgtgac???^ 
Tolt] ^^Jr^^ ^ ccgacaacct tcatataggt agaggggttt acgcgccacg catcaagcac 
4^681 nnr.lt '^^^'^ cggcgtcgag cgctacccac tcaggccggt cactcccct| atctctLca 
Toil] ^^^"^^r^ accggcgttg cctccctagc tcagttcggt tagagcgcct gtttcgtaat 
Toloi Ittnf'r g^ggttcgaa tccgtcgggg ggctcaatga gcggatacac aatcgcttgg 
Tolll lalllllllt J^^'^^^^^" cggcgtcatt gagggtcgag cgctcttcaa taagaagccg 
40861 ggcgacacgc tgagcgaaca cgtctggtca tggttcgcca cgcaaagcgg cagtacaggc 
ToTA Talalllttt f '^^^^^^^ tgctcgacgc tttgcgctac tggcct?ca? gggttggl?c 
40981 actgcccact tcatgacggg cggtcgcttc tagcgctgcc g.cg.cccagcc tactcacgtg 
41l2i llTcllTatl P^^^^'^^^e ^^^atagggg ggtgtccccg gaaggggggg tgcccta?g? 
tlTei llaltJtl^ tctcgactgt agggactggg ctactcatgg tgggcgctgt gctcagcalc 
4^2!^ lllntnnn ^^^^^^'^f^^^ cgcagtgtga agagccatgc gaagcggcgt gctgctatcg 
tllll tatlln^f caacgctgcg gcgaagatgc gtcgtgctat ccgtaaggca gtgggcgcg? 
4^35^ cac?^a^^^^ '^^^^^^^^^'^ tggtacctgc catcacagct cgacgtcgac calaLIagc 
4^40^ Jtlnr ^^gcggcgaa gacgttgaag gcaacgtgca agcgctgtgc aagcgatgcc 
41401 ataagacgaa gacggcaatg gacttcggca agcgtccgtt ctgagggggc ggggcgg^tc 
41461 cgaagttggc ggcgtgtgcc ctcagcgat ggggcggccc 
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